AD  74x207 


I 


SEQUENTIAL  MULTIVARIATE  QUALITY  CONTROL  TESTS 
USING  TOLERANCE  REGIONS 


Ronald  L.  Boase  and  John  E.  Walsh 


Technical  Report  No.  lift 
Department  of  Statistics  ONR  Contract 


Reproduction  in  whole  or  in  part  is  permitted 
for  any  purpose  of  the  United  States  Government 

Reproduced  by 

NATIONAL  TECHNICAL 
INFORMATION  SERVICE 

Sprinoheld,  Va  22151 

This  document  has  been  approved  for  public  release 
and  sale;  its  distribution  is  unlimited. 


DEPARTMENT  OF  STATISTICS 
Southern  Methodist  University 
Dallas,  Texas  75222 


M.Vf  s  v:''  I'h 


,v 


LAfflEl  NOTICE 


THIS  DOCUMENT  IS  BEST 
QUALITY  AVAILABLE.  THE  COPY 
FURNISHED  TO  DTIC  CONTAINED 
A  SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


:  ::rin;:;  !■' Al,  MULTIVARIATE  0UAL1TY  control  tests 
Uf.IMO  i’CLEEANCF.  REGIONS 
bv 

Ronald  I  .  Roaso  and  John  !". .  Walsh 


Technical  Report  Mo.  110 
Department  of  Statistics  ONE  Contract 

Anril  1,  197 1 


Research  sponsored  by  the  Office  of  Naval  Research 
Contract  N00014-68-A-0515 
Project  NR  042-260 


Reproduction  in  whole  or  in  part  is  permitted 
for  any  purpose  of  the  United  States  Government. 


This  document  has  been  approved  for  public  release 
and  sale;  its  distribution  is  unlimited. 


DEPARTMENT  OF  STATISTICS 
Southern  Methodist  University 


DOCUMENT  CONTROL  DATA  ■  R  &  D 

^K-firtty  clajM/n  a(«)i  1  of  ?»?/«»,  of  ijft'fMn  (  /im/  indexing  attnt  tn  ti  on  must  be  entered  wht*.  the  overall  report  in  c  testified) 


'  .i  a  *  L  ?  i  V.  r  »  lorp.w^rr  j  2*.  REPORT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 

OUTHKRN  METHODIST  UNIVERSITY 


2b.  CROUP 

UNCLASSIFIED 


HF.PORI  TITLE 


Sequential  Multivariate  Quality  Control  Tests  Using  Tolerance  Regions 


*  I'tsC’iPTiVE  notes  fTjpe  o/  wpoff  a/id  incfiMicf  dale*) 

:  •.■clinical  Report _ 

*>  -  t  MORi.ii  f  First  name,  middle  initial,  last  name) 

Ronald  L.  Lease 
,’rhn  E.  Walsh 


t>  SEPOR.  DATE 

April  1,  1'tTi 


t>*  CON’R*CT  or  CRiNT  NO 

:;oOC14-b8-A-0515 

PROJtC  T  NO 

NR  042-260 


jrn.  TOTAL  NO.  or  PACES  7b.  NO  OF  REES 

86  19 


9«.  ORIGINATOR* l  REPORT  NUMBER'S) 


9b-  other  REPORT  nOIS)  (Any  ofher  numbera  that  may  be  assignsJ 
this  report) 


io  distribution  statement 

This  document  has  been  approved  for  public  release  and  sale;  its  distribution  is 
■unlimited.  Re-production  in  whole  or  in  part  is  permitted  for  any  purpose  of  the 
United  States  Government. 


t  SUPPLEMENTARY  notes 

1  2  SPONSORING  MILI  T  ARY  ACTIVITY 

OFFICE  OF  NAVAL  RESEARCH 

H  ABSTRAC  T 

Two  new  developments,  multivariate  sequential  significance  tests  and  a  method  of 
forming  multivariate  two-sample  tolerance  tests,  are  proposed.  The  sequential  signif¬ 
icance  test  considered  is  a  fixed-length  succession  of  two-sample  subtests  where  each 
subtest  reuses  some  or  all  of  the  data  for  preceding  subtests.  By  proper  choice  of 
the  subtest  statistics  and  use  of  a  permutation  basis  the  subtests  are  made  independent 

The  class  of  multivariate  two-sample  tolerance  tests  developed  by  the  proposed 
method  are  directly  applicable  as  subtests  in  the  sequential  significance  tests.  The 
proposed  method  is  based  on  a  new  technique  of  constructing  tolerance  regions  for  the 
two-sample  problem.  Subject  to  certain  mild  limitations  the  analyst  may  actually 
look  at  the  combined  observed  vector  values  in  order  to  construct  the  desired  tolerance 
regions.  This  advantage  can  be  used  effectively  in  choosing  the  shapes  of  the  toler¬ 
ance  region?*  so  as  to  emphasize  the  alternative  hypotheses  associated  with  the  selectee 
test  statistic.  The  only  requirements  are  that  the  joint  null  distribution  of  the 
combined  data  be  a  symmetric  function  and  that  the  observations  be  such  that  the  pro¬ 
posed  construction  process  is  unique  with  probability  one. 
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Proposed  are  a  class  of  sequential  significance  tests  of  limited 
length  and  a  method  oC  forming  multivariate  two  -semnle  tolerance  tests. 
The  sequential  significance  test  studied  is  conducted  as  a  finite  suc¬ 
cession.  of  certain  multivariate  two-samnlc  permutation  sub tests .  The 
two-sample  tolerance  tests  produced  by  the  proposed  method  satisfy  the 
required  properties  as  suhtests.  The  sequential  significance  test 
using  these  two  sample  tolerance  suhtests  can  he  practically  applied  m 
multivariate  quality  control. 

The  sequential  significance  tests  are  discussed  in  Chapter  II. 

The  data  considered  are  independent  sets  of  random  samples  which,  under 
the  null  hypothesis,  are  from  the  same  multivariate  population.  Two- 
sample  subtests  are  performed  in  a  sequential  manner  where  each  subtest 
reuses  all  or  part  of  the  data  considered  in  previous  suhtests.  The 
previous  data  used  by  each  subtest  may  he  determined  by  random  selection 
from  the  totality  of  data  considered  by  the  preceding  subtest.  The  over¬ 
all  test  is  significant  whenever  anv  one  sub test  is  significant  and  is 
not  significant  when  all  suhtests  are  not  significant  and  a  maximum 
number  of  subtests  have  been  made. 


Permissible  subtests  are  permutation  tests  whose  statistics  are 
symmetric  in  the  totality  of  the  previous  data  used.  By  considering 


only  this  class  the  subtests  are  independent,  thus  the  significance 
level  of  each  subtest  is  not  affected  by  the  outcomes  of  preceding 
sub tests. 

Some  of  the  desirable  properties  of  the  sequential  significance 
test  for  quality  control  uses  are:  (1)  the  data  considered  may  be 
multivariate,  (2)  the  test  permits  legitimate  reuse  of  previous  data, 

(3)  the  permissible  subtests  are  independent  providing  accurate  evalua¬ 
tion  of  significance  levels,  (f*)  the  random  selection  of  previous  data 
at  each  subtest  level  can  be  effectively  used  to  emphasize  the  more 
recent  data,  and  (5)  the  permutation-randomization  approach  yields 
subtests  that  are  generally  applicable. 

In  Chapter  III  the  new  proposed  method  of  forming  multivariate 
two-sample  tolerance  tests  is  introduced.  This  method  yields  tests  that 
have  a  permutation  basis  and  satisfv  the;  requirements  for  subtests  in 
the  sequential  significance  tests  proposed  in  Chapter  II.  A  well  known 
existing  method  is  shown  to  be  a  special  case  of  the  proposed  method. 

The  data  required  by  the  proposed  method  need  not  be  independent  random 
samples  but  must  have  a  symmetric  joint  null  distribution.  Also,  any 
univariate  two-sample  tolerance  test  can  be  considered  as  a  multivariate 
two-sample  tolerance  test.  These  include  all  run  and  rank  tests. 

The  construction  process  of  forming  tolerance  regions  for  the 
proposed  method  is  outlined  in  Chapter  IV.  This  process  is  a  system¬ 
atically  stage.d  procedure  for  establishing  a  set  of  tolerance  regions 
for  the  two-sample  prohlem.  Certain  symmetric  information  is  available 
for  use  at  each  stage.  This  information  basically  includes  knowledge 
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of  the  combined  observed  vector  values  as  long  as  they  are  not  identi¬ 
fied  with  the  population  from  which  they  were  taken.  This  information 
accumulates  as  the  process  continues  providing  an  excellent  source  for 
determining  the  shapes  of  the  desired  tolerance  regions.  Thus,  the 
desired  tolerance  regions  can  be  constructed  with  the  goal  of  making 
the  selected  test  statistic  as  significant  as  possible. 

Some  suggested  techniques  for  applying  the  proposed  method  of 
forming  multivariate  two-sample  tolerance  tests  are  presented  in  Chapter 
V.  Also  included,  are  certain  practical  considerations  for  using  the 
new  construction  process  effectively  and  for  selecting  appropriate  uni¬ 
variate  tests.  A  general  outline  of  a  suggested  operational  procedure 
is  included. 

Certain  areas  of  possible  application  of  the  sequential  signifi¬ 
cance  test  using  subtests  formed  by  the  proposed  method  are  considered 
in  Chapter  VI.  Most  of  the  discussion  is  devoted  to  applications  in  the 
medical  field. 

The  last  chapter,  Chapter  VII,  contains  statements  of  the  basic 
theory  verifying  the  results  claimed  for  the  proposed  method  of  forming 
multivariate  two-sample  tolerance  tests.  The  important  results  are 
stated  in  the  form  of  a  theorem  and  corollary.  Because  of  its  unusual 
length,  the  proof  of  the  theorem  has  been  relegated  to  the  appendices. 
All  other  results  claimed  are  verified  in  the  discussion. 


CHAP! HR  II 


QUALITY  CONTROL  TESTS 

Sequential  random-'  .ation  tests  for  univariate  one-way  analysis  of 
variance  have  been  developed  by  Walsh  f 1 5 ]  and  [16].  Presented  here,  is 
an  analogous  extension  of  Walsh's  tests  using  multivariate  data.  Such 
tests  possess  desirable  properties  making  them  directly  applicable  to 
quality  control  uses. 

A  class  of  sequential  significance  tests  which  consist  of  a  pre¬ 
specified  number  of  subtests  is  developed.  Each  subtest  in  the  sequence 
reuses  all  or  part  of  the  total  data  used  in  the  acceding  subtest  in  a 
manner  which  establishes  independence  among  all  subtests. 

The  data  are  taken  in  sets  representing  independent  (finite)  random 

samples^  of  multivariate  observations  which,  under  the  null  hypothesis, 

2 

are  from  the  same  unknown,  but  partially-continuous  distribution.  Each 

subtest  is  a  two-sample  test  using  as  one  population  sample  (previous 

3 

data)  a  set  of  data  vectors  randomly  chosen  from  the  totality  of  ob¬ 
servation  data  vectors  used  in  the  preceding  subtest  and  as  its  second 

■^It  suffices  to  require  the  combined  observations  to  have  a  sym¬ 
metric  joint  null  distribution. 

2 

A  random  vector  is  defined  to  have  a  partially-continuous  distri¬ 
bution  if  at  least  one  component:  of  the  random  vector  has  a  continuous 
marginal  distribution. 

3 

The  previous  data  for  any  two-sample  subtost  may  then  include  the 
totality  of  observation  data  used  in  the  previous  subtost. 

-A- 


population  sample  (new  data)  one  ol  the  remaining  unused  data  sets.  The 
subtests  are  performed  sequentially  until  either  significance  is  obtained 
at  a  subtest  level  or  a  specified  maximum  number  of  subtests  have,  been 
made.  Significance  for  the  overall  test  is  obtained  only  when  a  subtest 
in  the  sequence  proves  significant.  Thus,  the  overall  test  will  not  be 
significant  if,  and  only  if,  all  subtests  in  the  sequence  arc  not  sig¬ 
nificant.  Exact  significance  levels  can  be  obtained  by  using  appropriate 
randomization-permutation  probability  models  and  subtest  statistics 
possessing  a  special  proportv  which  insure  independence  betv/een  subtests. 

Perhaps  the  most  desirable  feature  of  these  tests  is  their  ability 
to  legally  use  (in  a  orobabilistic  sense)  data  of  preceding  subtests. 

The  outcomes  of  the  preceding  subtests  in  many  similar  sequential  tests 
■vroduces  a  conditional  effect  on  the  significance  level  of  succeeding 
sub  tests;  however,  in  the  tests  studied  here,  the  subtests  are  indepen¬ 
dent  and  no  such  conditional  effects  exist  among  them.  Therefore,  if 
ai,O2,...,0»  0  the  significance  levels  of  the  k  subtests  composing 

the  overall  test,  the  significance  level,  ra,  of  the  overall  test  can 
be  computed  directly: 

k 

a  =  1  -  n  (1  -  a.). 

i=l  1 

Another  feature,  which  is  highly  desirable  in  applied  sequential  quality 
control  tests,  is  the  ability  to  maintain  a  limited  control  of  the  em¬ 
phasis  placed  on  preceding  data  sets  at  each  subtest  stage.  This  is 
accomplished  bv  properly  selecting  the  sizes  of  each  new  data  set  used 
in  the  sequence  end  the  size  of  the  previous  data  set  obtained  by  random¬ 
ization  at  each  subtesf.  stage.  Repeated  randomization  of  the  preceding 


data  will  emphasize  the  most  recent  <!nr.->  sets,  while  no  randomization, 
that  is, using  all  the  preceding  data,  will  tend  to  deemphasize  the  most 
recent  data  sets. 

A  third  feature  is  that  the  randomization-permutation  model  yields 
subtests  of  general  applicability  which  may  be  one  or  two-sided  tests 
and  can  be  oriented  toward  many  forms  of  the  alternative  hypothesis. 

The  randomization  contribution  to  the  model  insures,  under  the  null 
hypothesis,  that  the  observations  selected  as  previous  data  at  each  sub¬ 
test  stage  constitute  a  random  sample  from  the  population  representing 
the  totality  of  data  used  in  the  previous  subtest.  If  no  significance 
is  obtained  at  the  subtest  stage,  the  previous  and  new  data  sets  used 
are  combined  and  represent  a  random  sample  from  the  population  yielding 
the  combined  data.  This  combined  date  set  becomes  the  data  available 
for  randomiz  ition  (if  any)  defining  the  previous  data  set  for  the  next 
subsequent  subtest.  The  process  is  continued  until  either  significance 
is  obtained  at  some  subtest  level  or  a  specified  number  of  subtests  have 
been  made. 

The  permutation  model  is  used  to  establish  the  conditio1'  1  prob¬ 
ability  spaces  on  which  the  distribution  of  each  subtest  statistic  is 
determined.  If  the  observed  vectors  are  ordered  in  some  definite  but 
arbitrary  manner  (e.g.  the  order  in  which  they  were  obtained)  then  the 
sample  space,  induced  by  the  permutation  model,  constitutes  the  set  of 
all  permutations  of  the  observed  vectors.  Under  the  null  hypothesis, 
the  probability  of  any  permutations  is  the  same.  The  permutation  sample 
space  associated  with  iwo-samplo  problem  can  be  reduced  by  considering 
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the  set  of  all  possible  assignments  of  the  ordered  observed  vectors  into 
two  sets;  one  of  size  equal  to  the  new  data  set  and  the  other  whose  size 
corresponds  to  the  pr  vious  data  set.  Then,  under  the  null  hypothesis, 
all  possible  assignmt nts  made  in  this  manner  are  equally-likely . 

Now,  any  function,  symmetric  in  the  totality  of  observed  vectors, 
is  clearly  a  constant  on  all  points  of  the  associated  permutation  sample 
space.  That  is,  under  any  hypothesis,  this  function  is  a  constant  with 
probability  one.  Therefore,  the  function,  a  statistic,  is  indenendent 
of  any  other  statistic  defined  on  the  same  permutation  sample  space. 

This  fact  motivates  the  method  used  for  selecting  appropriate  subtest 
statistics  having  the  property  that  the  subtest  significance  level  is 
not  conditionally  affected  by  the  outcomes  of  previous  subtests. 

For  each  two-sample  subtest  consider  the  class  of  statistics  which 
are  symmetrical  in  the  totality  of  observed  vectors  defined  by  the  pre¬ 
vious  data  set.  Then,  for  any  fixed  set  of  observed  vectors  constituting 
a  possible  ordering  of  the  new  data  set,  the  value  of  such  a  statistic 
remains  unchanged  over  all  permutations  of  the  observed  vectors  in  the 
previous  data  set.  Also,  for  any  fixed  ordered  set  of  observed  vectors 
in  the  new  data  set,  this  statistic  is  defined  on  the  permutation  sample 
space  obtained  from  the  previous  data  set  and  on  any  permutation  sample 
space  constructed  on  a  set  of  observed  vectors  contained  in  or  containing 
the  previous  data  set. 

in  order  to  verify  that  this  class  of  statistics  possess  the  property 
that  in  the  sequential  process  each  subtest  statistic  is  independent  of 
the  results  of  the  preceding  subtests,  two  mutually  exclusive  cases  are 


are  considered. 
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First  consider  a  sequence  of  subtests  where  at  each  subtest  stage 
the  previous  data  was  taken  to  be  the  totality  of  data  used  in  the  pre¬ 
ceding  subtest.  That  is,  none  of  the  previous  data  sets  defined  at  each 
subtest  stage  were  obtained  by  randomization.  The  totality  of  data  used 
for  each  subtest  is  then  a  proper  subset  of  the  totality  of  data  used  in 
the  next  subsequent  subtest.  Thus,  any  permutation  of  the  observed  vec¬ 
tors  that  could  occur  for  any  preceding  subtest  corresponds  to  a  subtest 
of  the  permutations  of  the  observed  vectors  in  the  previous  data  set. 

For  any  fixed  order  of  observed  vectors  in  the  new  data  set  the  subtest 
statistic,  by  choice,  is  a  constant  over  all  permutations,  thus  all  sub¬ 
sets  of  permutations,  of  the  observed  vectors  in  the  previous  data  set. 
Therefore,  the  subtest  statistic  is  a  constant,  with  probabilitv  one,  on 
each  permutation  sample  space  of  the  preceding  subtests,  and  is  inde¬ 
pendent  of  the  permutation  observed  for  each  preceding  subtert.  Since 
the  outcome  of  a  subtest  is  determined  by  the  actual  permutation  of  vec¬ 
tors  observed,  then  the  subtest  statistic  is  independent  of  the  outcome 
of  all  preceding  subtests. 

Now,  consider  a  subtest  whose  previous  data  ret  was  obtained  by 
randomization  -  randomly  selecting  a  subset  from  the  totality  of  data 
used  in  the  preceding  subtest.  Since  the  new  data  sets  obtained  after 
randomization  are  independent  of  the  data  sets  used  prior  to  the  random¬ 
ization,  it  is  only  necessary  to  show  that  the  previous  data  set  is  in¬ 
dependent  of  all  outcomes  of  the  preceding  subtests.  To  verify  this,  it 
suffices  to  consider  only  those  preceding  subtests  occurring  after  the 
most  recent  previous  subtest  using  randomization,  for  induction  can  be 
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used  to  justify  the  remainder  of  the  assertion.  Verification  then  fol¬ 
lows,  since  the  totality  of  observation  vectors  in  the  preceding  subtest 
is  not  affected  by  any  permutations  that  could  occur  or  any  subsets  of 
them.  Thus,  the  :  ubtest  statistic  is  independent  of  the  outcomes  of 
these  preceding  subtests. 

If  randomization  is  used  in  the  next  following  subtest  to  obtain 
its  previous  data  set  the  above  verification  holds.  However,  if  the 
next  subsequent  subtest  does  not  use  randomization,  the  situation  is 
essentially  the  same  as  that  cited  in  the  first  case.  The  proof  then 
follows  by  induction. 

The  sequential  significance  test  description  given  by  Walsh  [16] 
for  the  univariate  case  is  analogous  for  the  multivariate  case.  However 
in  this  paper,  the  number  of  observed  vectors  in  each  new  data  set  is 
permitted  to  be  one  or  more  provided  the  first  data  set  used  is  of  suf¬ 
ficient  size  to  insure  that  the  desired  significance  levels  of  all  sub¬ 
tests  and  the  overall  test  can  be  obtained  (or  approximately  obtained). 
In  like  manner,  for  exact  and  approximate  permutation  tests,  the  deter¬ 
mination  of  sharp  lower  bounds  for  the  subtest  significance  levels,  and 
the  considerations  on  the  sample  sizes  used  in  each  subtest  discussed  by 
Walsh  also  hold  for  the  multivariate  use  as  well.  This  material  has 
been  thoroughly  and  clearly  presented  in  the  above  reference. 

To  establish  a  sequential  significance  test,  having  all  the  proper 
ties  outlined  above,  would  first  require  finding  appropriate  two-sample 
multivariate  subtests.  These  subtests  not  only  should  be  selected  to 


emphasize  the  alternative  hypotheses,  but  be  feasible  in  application. 
That  is,  under  the  permutation  model,  the  null  distribution  of  the  sub¬ 
test  statistic  should  either  be  well  approximated  by  some  known  distri¬ 
bution  or  easily  determined.  The  two-sample  tests  considered  in  this 
paper  are  based  on  tolerance  regions.  It  will  be  shown  that  a  two-sam¬ 
ple  tolerance  test  has  a  permutation  basis  end  the  associated  test  sta¬ 
tistic  is  symmetric  in  the  observations  in  both  samples  separately. 

Thus  all  two-sample  tolerance  tests  are  permissible  as  subtests. 


CHAPTER  III 


MULTIVARIATE  TWO-SAMPLE  TOLERANCE  TESTS 

A  new  method  of  forming  two-sample  tolerance  tests  is  proposed.  As 
an  introduction  to  this  method  an  existing  method  is  presented  first. 

The  existing  method  is  a  special  case  of  the  new  proposed  method.  The 
basic  difference  between  them  is  the  manner  in  which  tolerance  regions 
are  formed.  The  new  method  is  shown  to  overcome  several  of  the  major 
disadvantages  common  to  the  existing  method.  Both  methods  yield  tests 
that  are  suitable  as  subtests  in  the  quality  control  tests  presented  in 
the  preceding  chapter. 

The  same  basic  philosophy  is  used  in  both  methods  to  form  two-sample 
tolerance  tests.  A  test  is  fundamentally  established  on  a  set  of  toler¬ 
ance  regions.  The  number  of  observations  from  one  population  falling 
within  each  tolerance  region  is  counted.  These  frequency  counts  are  then 
used  to  determine  the  outcome  of  the  test. 

The  existing  method  C  [  19 3 » [  1  ],  and  [  3  ])  requires  that  the  data 
be  two  independent  random  samples.  One  of  these  samples  is  used  to  con¬ 
struct  a  set  of  disjoint  nonparametric  tolerance  regions.  The  other 
sample  is  reserved  for  determining  the  region  frequency  counts.  The  con¬ 
struction  process  used  to  establish  the  tolerance  regions  ( [17  ] » [13  ] » [12 )  > 
and  [9  ])  is  a  systematically  staged  procedure  (referred  to  here  as  the 
"standard  one-sample  process"). 


I 


-12- 

In  the  first  stage  Lite  sample  space  it  partitioned  into  two  disjoint 
tolerance  regions,  called  blocks.  This  is  accomplished  by  choosing  some 
real-valued  function  whose  null  distribution  is  continuous  and  selecting 
some  order  statistic  on  the  set  of  function  values  defined  on  the  sample. 
These  choices  can  be  based  on  any  independent  information  that  is  avail¬ 
able  prior  to  taking  the  observations.  The  value  of  this  order  statistic 
and  the  observed  vector  yielding  it  is  all  the  additional  information 
permitted  ([6], [7], [8],  and  [9  ])•  The  function  equated  to  this  given 
value  of  the  chosen  order  statistic  defines  a  cut  on  the  sample  space 
producing  two  blocks.  This  partitioning  also  separates  the  sample  into 
•vo  conditionally  independent  subsamples  [3].  Using  this  limited  in- 
bmr  '  on ,  one  of  the  two  blocks  formed  is  considered  for  division  in  the 
second  stage.  The  above  process  is  repeated  only  for  the  subsample  of 
fr-'c— .”t r'ons  associated  with  the  chosen  block.  However,  the  function 
or";  dcred  "X  the  second  stage  may  differ  from  the  function  selected  at 
rl--t  stage  level  but  its  null  distribution  must  also  be  continuous. 
The  chosen  block  is  then  partitioned  into  two  new  blocks.  Thus,  by  the 
end  of  the  second  stage  three  disjoint  blocks  have  been  formed.  Again 
the  only  new  permissible  knowledge  are  the  values  of  the  order  statistics 
cad  the  observed  vectors  yielding  them.  The  process  is  continued  until 
the  desired  number  of  tolerance  regions  or  blocks  has  been  obtained.  The 
content  of  the  tolerance  regions  can  be  determined  by  the  number  of  unused 
observations  lying  within  them.  If  the  process  were  continued  until  all 
observations  were  used  to  define  cuts  (each  time  producing  two  new  blocks) 
tic  resulting  blocks  are  called  basic  blocks  or  statistically  equivalent 
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blocks.  Thus,  the  b lc  I'1’''  ••••’!*•  earlier  in  the  process  woul  d  consist 

of  a  fixed  number  of  basic  blocks.  A  more  common  measure  of  tolerance 
region  content  is  the  number  of  basic  blocks  contained  within  the  toler¬ 
ance  region. 

Finally,  after  the  desired  number  and  basic  block  contents  of  the 
tolerance  regions  have  been  established  on  the  first  sample,  the  corres¬ 
ponding  set  of  frequency  counts  on  the  observations  in  the  second  inde¬ 
pendent  sample  are  made  yielding  the  outcome  of  the  test.  The  null  dis¬ 
tribution  of  the  test  statistic  is  determined  from  the  joint  null  dis¬ 
tribution  of  the  block  frequency  counts.  This  latter  distribution  has 
been  established  ([18]  and  [19]). 

All  tests  based  on  this  method  have  a  permutation  basis.  This 
fact  is  shown  by  the  corollary  given  in  Chapter  VII.  The  corresponding 
test  statistics  are  symmetric  in  the  totality  of  sample  observations  on 
which  the  blocks  were  defined;  also,  they  are  symmetric  in  the  totality 
of  sample  observations  used  to  determine  the  block  frequency  counts. 

This  means  that  any  two-sample  test  obtained  by  this  method  can  be  used 
as  a  subtest  in  the  previous  chapter.  Furthermore,  either  the  "previous" 
or  "new"  data  set  can  be  used  to  define  the  tolerance  regions. 

Unfortunately,  this  existing  method  has  a  major  disadvantage; 
namely,  the  limited  freedom  in  selecting  the  shapes  of  the  tolerance 
regions  so  as  to  emphasize  the  alternative  hypotheses  of  interest.  At 
each  stage  in  the  above  process  only  a  small  amount  of  knowledge  was 
available  on  which  to  choose  the  shapes  of  the  cutting  functions.  No 
knowledge  or  consideration  of  the  second  sample  data  was  used.  By  com¬ 
bining  both  sets  of  observations  and  considering  only  certain  permissible 
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in  format  ion  (mainly  symmetric)  a  vast  amount  of  knowledge  can  be  obtained 
on  which  to  base  tho  .  han<-.«;  of  tol.-onee  regions .  This  concept  is  ex¬ 
plored  by  tho  next  •nvn'hod. 

The  proposed  new  method  of  forming  two-sample  tolerance  tests  re¬ 
quires  only  that  the  t wo  sets  of  data  have  a  symmetrical  joint  null  dis¬ 
tribution.  both  sets  of  observations  are  used  to  construct  a  set  of  dis¬ 
joint  tolerance  regions  on  one  of  the  two  sets  of  observations.  The 
process  of  constructing  the  tolerance  region  is  similar  to  the  standard 
one-sample  process  hut  allows  knowledge  of  all  observed  vectors  excluding 
their  set  association  <  i .  o .  knowledge  of  the  observations  within  each  of 
the  two  original  dal  a  set.  j s  forbidden).  This  construction  procedure 
will  be  referred  to  as  ‘be  "generalized  Mock  construction  process  for 
the  two- sample  problem"  c-  simply  as  the  p*  process. 

First,  one  of  : he  (wo  data  set;  is  designated  as  the  set  on  which 
the  tolerance  region:;  ( o-  blocks)  will  he  formed  [This  set  will  be  referred 
to  as  the.  "designated  set"! .  Trior  t:o  the  format  ion  of  any  blocks  on  the 
sample  space  a  certain  .amount  of  information  is  available.  This  includes 
all  information  which  is  symmetric  with  respect  to  the  totality  of  random 
vectors  yielding  the  combined  set  of  observations.  Thus,  knowledge  of 
the  observed  vector  values  is  permitted  as  long  as  all  vectors  arc  not 
associated  (or  identified)  with  cither  of  the  two  data  sets.  This  means 
that  the  totality  of  the  unidentified  observed  vectors  can  be  "looked-at" 
and  treated  numerically  and/or  graphically  in  any  manner.  Any  function 
defined  on  any  subset,  of  the  combined  set.  of  unidentified  observations 
would  be  symmetric  information  permitted.  All  of  this  information  plus 
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any  independent  information  available  prior  to  taking  the.  observations 
can  then  be  used  to  select  the  first  stage  cutting  function.  This  func¬ 
tion  must  be  real-valued  and  must  either  have  a  continuous  null  distri¬ 
bution  or  be  selected  in  such  a  way  as  to  guarantee  that  it  will  never 
pass  through  more  thn;.  one  observed  vector  for  all  the  values  in  its 
range.  The  location  of  the  cut  is  then  determined  by  selecting  some 
order  statistic  on  the  set  of  function  values  defined  on  the  designated 
..et.  Oi.ly  the  value  of  this  order  statistic  and  tin.:  observed  vector 
yielding  it  can  he  identified.  Tims,  one  of  the  observed  vectors,  the 
one  associated  with  the  cut,  is  identified  and  all  other  observations 
remain  unidentified.  The  additional  information  that  now  becomes  avail¬ 
able  includes,  the  two  sets  of  unidentified  observations  falling  within 
the  two  new  blocks  formed  and  all  information  which  is  symmetric  with 
respect  to  both  sets  of  random  vectors  yielding  these  two  sets  of  un¬ 
identified  observations.  Then  a  block  is  chosen  for  the  next  stage 
division  (it  must  contain  at  least  one  observation  from  the  designated 
set).  A  decision  must  be  made  at  this  time  to  cither  reserve  the  remain¬ 
ing  block  for  possible  future  division  at  some  later  stage  in  the  B* 
process  or  to  never  divide  it  at  any  stage.  If  the  latter  decision  were 
made,  all  observations  lying  within  the  remaining  block  can  be  identified 
with  the  original  data  sets.  All  this  information  is  then  used  to  select 
the  next  cutting  function.  This  process  is  continued  until  the  desired 
tolerance  regions  have  been  formed.  The  joint,  null  distribution  of  the 
block  frequency  counts  at  any  stage  is  the  same  as  the  corresponding 
distribution  determined  bv  the  standard  one-sample  process  (see  Chapter 
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VII).  Thus"* .  ’  *  *  '>■  ;•’*  .V  !  1  to  the  la  ,r  possible  stage 

the  resultin',  h  *  .-■>  ;;k a1*-;  stat ; .  tically  ••■pi  5  valent  blocks  in  the  sense 
that  the.  joint  null  rl  i  stribut  i  on  of  the  basic  block  frequency  counts  is 
the  same  ar  that:  determined  bv  the  standard  process* . 

In  Chapter  VII  it  is  shown  that  any  test  produced  by  this  new 
method  also  has  a  permutation  basis  and  its  associated  statistic  is  sym¬ 
metric  in  the  set  of  observations  on  which  the  tolerance  regions  were 
defined  and  on  the  set  of  observations  used  to  establish  the  block  fre¬ 
quency  counts.  Then  all  two-sample  tolerance  tests  defined  by  this  new 
method  are  usabl  -  as  sub  tests  in  Chapter  IT.  Also,  either  the  "previous" 
or  "new"  data  sets  can  he  used  to  define  the  block  frequency  counts. 

One  primary  advantage  the  P*  process  has  over  the  standard  one- 
sample  process  is  the  vast,  amount  of  information  made  available  for 
forming  tolerance  re  ions.  'ill is  advantage  allows  one  to  select  desir¬ 
able  shape;'  of  the  tolerance  regions  so  as  to  make  the  test  statistic, 
previously  chosen,  as  significant  as  possible.  This  is  equivalent  to 
emphasizing  the  a.  3  trrn.it  i'»'  hypotheses  for  which  the  test  accentuates. 

A  second  important  advantage  of  the  P*  process  is  that  it  is  not 
necessary  to  proceed  through  all  stages  once  sufficient  information  to 


The  term  "statistically  equivalent  blocks,"  originally  defined  by 
Tukey  [12],  actual lv  referred  to  a  set  of  tolerance  regions  obtained  at 
the  completion  of  the  process  described  by  the  first,  method.  This  con¬ 
cept  basically  defined  statistically  equivalent  blocks  as  a  set  of  N  +  1 
tolerance  regions,  formed  by  the  first  method,  whose  joint  coverages 
represent  tie  harycontrir  coordinates  of  a  random  point  uniformly  dis¬ 
tributed  in  an  b-d  imonr.  i  ena  1  simolex  (where,  N  denotes  ttie  sample  size). 
In  this  paper,  statist  iral  ]v  equivalent,  blocks  will  he  defined  as  a  set 
of  tolerance  regions  on  "hich  the  joint,  null  distribution  of  the  fre¬ 
quency  counts  det  erwi  iv  .!  !-■’  a  second  sample  is  the  same  as  if  it  were 
determined  by  a  set  of  statistically  equivalent  blocks  defined  by  Tukey. 


evaluate  the.  .>m  ,•  i  .  ,•  i  ,  ;;ia,!c  nv.u  i I ,  .  :  1 1 ,  >  process  b* 

allows  some  ohsorvat  i<w:  !■  :  com  1  vt-- 1  v  idea r  i  f ied  at  various  stages  . 

For  example,  i a  •  i  t  .a  >  n  Mors  e-the--  has  !kwii  designated  as  a  block 
"never- 1 o-*b''-(l i ” 1 1 or  is  a  basic  Mock,  then  all  observations  lying 
within  are  completely  identified  with  their  original  data  sets.  Thus, 
sufficient  ini  or-:' at.  ion  u'>otil  t.he  block  frequency  co’.air  may  possibly  be 
determined  earl  \  in  t  lie  process  so  as  to  conclude  the  outcome  of  the 
test . 

Another  advent  arc  t  v  the  data  need  not:  he  independent  random  samples. 
The  corresponding  re.-pii  ivrtent  is  that  rhe  joint  null  dist  ribution  of  the 
combined  data  rots  ~:ust  ’<«•  .  svmmetri  c  function. 

It  is  alri  shear  in  Charter  tv  that,  the  standard  one-sample  process 
is  a  special  ca.-e  of  ’  a  '  ’excess.  This  implies  that  the  standard 
process  ca  t  be  :■<'•••  -»v  re. mi  ring  only  that:  the  combined  data  have  a 

symmetric  joint  nu! ’  distribution  function. 

The  type:  of  test,  that  may  he  formed  by  either  method  is  considered 
next.  Anderson  f  1  ]  shews  that  any  two-sample  tolerance  test  adapted 
for  the  univariate  case  is  also  directly  applicable  for  the  multivariate 
case  using  tin'  existing  method.  Tests  based  on  multivari.te  data  are 
obtained  by  relating  each  multivariate  tolerance  region  to  a  univariate 
tolerance  region  containing  the  same  number  of  basic  blocks.  Since  the 
joint  null  distribution  of  the  block  frequency  counts  is  unchanged 
whether  the  multivariate  or  univariate  case  is  considered,  then  all  uni¬ 
variate  two-sample  tolerance  tests  are  usable.  However,  the  joint  null 
distribution  of  the  block  frequency  counts  for  the  new  method  is  the 
same  as  hat  for  the  existing  method;  thus,  univariate  two-s;mple  toler- 


ance  tests  apply  equal  1  v  well  to  the  new  me.'hod.  The  types  of  nonpara- 
metric  tests  available,  for  <  ousi  tie  rat  .ion  include  all  run  and  rank  tests 
(see  pages  34  through  80  in  reference  f 1 4 1 ) -  Some  of  these  tests  are 
described  later  in  Chapter  VI. 

A  detailed  description  of  the  proposed  P*  process  is  given  in  the 
next  chapter.  This  is  followed  by  two  chapters  devoted  to  suggested 
techniques  and  areas  of  practical  application.  The  final  chapter  estab¬ 
lishes  verification  of  all  results  claimed. 


CHAPTER  IV 


A  GENERALIZED  BLOCK  CONSTRUCTION  PROCESS,  B* 

FOR  THE  TWO- SAMPLE  PROBLEM 

The  process  presented  is  a  systematically  staged  method  of  con¬ 
structing  a  set  of  distribution-free  tolerance  regions  (blocks)  for  the 
two-sample  problem.  The  data  consist  of  two  sots  of  at  least  partially 
continuous  multivariate  observations  which  are  defined  on  the  same  sample 
space.  Under  the  null  hypothesis,  the  Combined  data  must  have  a  symmetric 
joint  cumulative  distribution  function.  The  sets  and  the  observations 
within  each  sot  are  not  required  to  be  independent.  At  each  stage  in  the 
process  specific  information  on  certain  subsets  of  the  combined  set  of 
observations  is  used  to  establish  two  new  blocks  (regions)  within  one  of 
the  blocks  previously  formed.  This  information  collectively  increases 
as  the  process  continues,  providing  a  better  basis  on  which  the  shapes 
and  locations  of  future  blocks  may  be  controlled.  This  freedom  greatly 
increases  the  ability  to  construct  blocks  so  as  to  emphasize  the  alter¬ 
native  hypotheses  -  thereby  potentially  increasing  the  power  of  any  ‘ho- 
sample  tolerance  test. 

The  first  two  stages  and  a  general  stage  are  discussed  in  detail 
stressing  the  amount  of  permissible  information  available.  Some  suggested 
techniques  for  exploiting  the  information  obtained  are  presented  in 
Chapter  V. 
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ho'_  \  1  ,X..  ,  .  .  .  ,  >  1  --  1  V|,  V A,..-,  V  1  b»»  two  sots  of 

n  n  m  in 

p-component  random  vvt  .-r.s  defined  on  a  samnlc  span?  X.  Th  .se  random 

vectors  must  he  at  least  partially  continuous  and  have  a  svmmetric  joint 

null  cumulative  distribution  function.  The  process  B*  will  be  demonstrated 

by  forming  a  set  of  bKv-ks  (tolerance  regions)  on  the  observations  on 

0  . 
n 

For  convenience ,  a  tew  basic  terms  and  symbols  will  be  defined 
and  used  through  the  remaining  text. 

Dq_fiai_i_t  i  cm _ \ 

An  observation  is  said  to  be  identified  if  it  can  be  associated 

with  the  set  of  random  vectors  which  yielded  it;  that  is,  associated  with 

either  0  or  .  Thus,  a  set  of  oh  sc rvati ons  is  identified  if  each  ob¬ 
it  m 

servation  within  the  set  is  identified. 

Definition  2 

Information  on  a  set  of  observations  *s  said  to  be  symmetric  with 
respect  to  a  block  (or  union  of  blocks)  if  the  information  is  unaffected 
by  interchanging  the  roles  (relabeling  the  identities)  of  the  random 
vectors  yielding  the  observations  falling  within  the  block(s)  .  For 
example,  if  Vi,V?,...,V  denote  the  set  of  random  vectors  yielding  ob¬ 
servations  which  fell  within  block  B,  then  the  information,  7,  defined  on 
some  set  of  observations  which  may  or  may  not  contain  those  in  B,  is 
symmetric  with  respe-t  to  B  if  for  any  reordering  ( V  J  ,VJ[ , .  . . ,  V£)  of 
(Vi,V2,...,V  )  7  is  unchanged. 

Definition  1 

The  symbols  o  ,  o  and  o 

n  m  n+m 


will  denote  the  sets  of  observations  on 


rosp*'<-t  i  vulv  ,  where  C 


is  the.  combined  set  of  ran- 


G  ,  0  ,  and 
n  m 


n+m 


dom  vectors  {X  i  ,  X->  ,  .  .  .  ,  X  ,  Y;,...,Y  1.  Likewise,  O'.  will  denote  the  set 

n  m  j 

-k 

of  random  vec'ors  in  0  viol  dine  the  set  of  observations  o.  in  0  , 

n+m  j  n+m 

f 

which  fall  within  Mock  U  at  the  k  stage  ’  n  B* ,  for  j  -  l,2,...,k+l. 
Definition  A 


The  symbols  'A  \ are  a  relabel  ing  of  the  random  vectors 

n+m 


in  0  ,  •  Also,  the  symbols  7?.  ,7A  , . . .  ,Z^  , . .  will  represent  the  random 
n+m  ‘  1  ^  k.  f  j ) 

vectors  in  and  X‘j  ,X.|  ,  .  . .  and  *  *  *  »Yt(j)  are  tflose  ranc*°m 

vectors  in  H  and,  f*  0  resnectivel v,  defined  at  the  k*1*1  stage, 
j  n  J  m 

All  lower  case  letters  x.  v,  and  z  will  denote  corresponding  observed 


,1 


vectors . 

Stage  1 

At  the  hen i mine,  of  the  first  stage  all  information  on  o  which 

n  ttti 

is  symmetric  with  respect  to  X  (considering  the  space,  X,  as  a  non-random 

block)  is  permitted  as  well  as  all  prior  information  (i.e.  information 

available  prior  to  taking  the  observations:  previous  observations,  etc.). 

The  totality  of  this  information  is  denoted  by  J i . 

If  at  least  one  observation  in  o  ,  is  identified,  then  any  infor- 

n+m 

mation  on  o  is  not  symmetric  with  respect  to  X.  For  example,  suppose 
n+m 

z*  C  o  ,  is  identified  with  0  .  Bv  interchanging  the  roles  of  the  ran- 
n+m  n 

dom  vectors  in  G  with  those  in  0  the  information  that  z*  is  identified 
n  m 

with  0  is  no  longer  true.  Also,  any  information  on  the  set 

is  not  symmetric  with  respect  to  I  for  the  same  reason.  Trivially  then, 

since  o  and  o  are  completely  identified,  their  knowledge  is  forbidden, 
n  m 

Suppose  o  is  unidentified,  then  the  knowledge  of  the  observed 
n+m 


lit!:  i 


fi'i.j-i  «  L  tv. 


- .  line 


any  interchanging 


1 1- r. 


of  the  sal*  i>:  la..  :  ainio.1.  actors  in  <)  does  not  in  anv  way  affect 

n+m 

tiie  values  of  tin-  observed  vector:,  in  o  .  and  o  .  still  remains  un- 

n+iii  n+m 

identified.  Similarly,  anv  subset  of  o  is  unidentified  and  is  sym- 

n+m 

metric  with  respect  to  X.  furthermore,  any  function  on  any  subset  of 
observations  in  o  ,  would  he  symmetric  witfi  respect  to  X .  Since  the 

jyhni 

sample  sizes  in  and  n  would  he  considered  as  prior  information,  it  is  per¬ 
missible  to  select  two  subsets  in  o  ,  of  sizes  n  and  n  to  be  likely 

n+m 

candidates  1  or  the  sets  o  and  o  . 

n  m 

After  !  ,  has  been  established,  the  next  step  is  to  select  some 
integer  ii  •  l,2,...,n}  and  some  real-valued  measurable  function 

<}■)  (z,J  i )  which  either  liar,  a  continuous  null  distribution  or  is  chosen 
such  that  no  ties  exist  in  the  set  {<J-i  (z  ,7  j )  |  z  ft  o  } ,  if  possible. 

Through  some  independent  source* ,  having  full  knowledge  of  o  ,  the 
i  iot  largest  •■.a!  uc  in  the  set  i'<J>i(z,Ji)  |z  f.  o  }  is  determined.  This 
value  and  tlv  observed  vector  in  r>  ,  say  ,  yielding  it  are  available 


info mat  ion. 

C'  ,  ,  . 

i  ;  s  t  • 

■  =  ci,  then 

t  lu* 

function  v  1  (>; ,/ 1 )  =  c\ 

is  used  to  d: 

!  Y  i  « I  • 

•  the* 

sample  space  .V  into 

t  VO 

open  rop.ions: 

...  ‘  i  (>:,  .'i  >  •  c.]} 

and 

hi  -  '  v-  ■  A" !  d  i  ( x ,  7  i )  >  ci)  . 

l  l 

The  tv  v  ir.  term  at  ion.  that  is  permitted  after  tin-  blocks  B,  and  B2 

have  1  > .  .  • :  :  r:.;-  ! ,  -on  its  of  <•  j ,  x;j;  and  the  two  subsets  of  observations 

o'  and  i '  rout  :ii.rd  in  n  .  Add  it  ional  information  permitted  at  the  end 
1  •  u+  in 

*  Jn  this  cue.  .  an  independent  source  could  bo  an  assistant  or  a 
<1  j  g  i  t  a  1  oo;  put  ■  r . 
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ol  stage  .!  also  .includes  all  information  on  u,,  o_  ,  and  on  o  y 

1  ?  n-Hn 

(including  xp  which  is  symmetric  with  respect  to  both  Bj  and  B2 .  That 
is,  any  information  which  remains  unchanged  when  the  roles  of  the  random 
vectors  in  0]  are  interchanged  and  also  is  unchanged  whenever  the  roles 

l 

of  the  random  vectors  in  9,  are  interchanged.  This  type  of  information 
will  be  defined  as  information  which  is  "symmetric  separately"  with  re¬ 
spect  to  a  set  of  blocks.  Now,  if  ij  =  1,  the  block  b|  cannot  contain 
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any  observation  in  o  ,  hence  o]  is  completely  identified  (although  o, 
is  unidentified  if  n  >  1).  Similarly,  if  ij  =  n  then  o2  is  completely 
idevtifled  and  o.  is  unidentified  for  n  >  1.  Whenever  ii  j*  1,  n  (n  >  2), 
then  both  o.  and  o  are  unidentified.  Information  of  this  type  may  be 
used  advantageously  m  choosing  the  shape  and  possibly  the  location  of 
'■  he  r-'-t  rev.-  blocks  (see  Chapter  V).  By  definition  of  B  j ,  B*  and  cj,  it 

is  easy  to  deduce  that  there  are  exactly  ii~l  and  n-ii  observations  in 

-  !  1  - !  -1 

o^  lying  witnin  B,  and  J?2 ,  respectively.  Since  o,  and  o2  are  permitted 

th--  ‘  lie  exact  number  of  observations  in  o  falling  within  B,  and  B„  can 

m  12 

ho  c  -li.  mi  m  . 

Stage  2 

After  cons  i  deri  nr,  all  permissible  information  available  at  the  end 

of  strge  1,  the  next  step  is  to  select  cither  block  B j  or  block  k*  for 

division  at  the  second  stage  level.  The  particular  block  chosen  must 

contain  at  least  one  observation  in  o  .  In  addition,  the  other  block 

n 

not  chosen  for  stage  2  division  must  be  considered.  One  of  two  actions 
are  required:  (1)  the  block  in  reserved  for  potential  division  at  some 
later  stage  (it  rust  contain  at  least  one  observation  in  oj  or  (2)  the 


bluvA  w  i  *  *  .  .. :  ;  •>  '  ‘  jovct-ss .  The  i  irst  act  ion  does  not 

necessarily  imply  that  the  him!,  will  eventually  lx;  divided  but  that  it 
may  be  considered  for  division  at  some  future  stni’.e.  Tf  the  second  ac¬ 
tion  were  taken,  it  i s  permissible  t o  identify  all  observations  falling 

within;  although,  some  of  these  observations  ntav  be  in  o  .  This  action 

n 

could  provide  considerable  information  especially  if  there  were  only  a 

feu  observations  in  o  and  mnnv  observations  in  o  lving  within.  Then 

n  m 

a  better  choice  of  candidate  sets  could  be  made. 

The  next  step  is  to  determine  the  information  T 2  that  can  be  used 
to  select  the  next  cutting  function,  (^(z, ^2)-  The  type  of  information 
permitted  in  I j  depends  upon  the  action  taken  above.  If  both  blocks 
were  considered  for  division,  one  at  the  second  stage  level  and  the  other 
at  some  future  stage,  then  1 2  is  defined  to  consist  of  all  previous  in¬ 
formation  and  all  informat  ion  on  oj ,  o2  and  o^^  which  is  symmetric 

1  > 

separately  with  respect  to  both  and  B^.  That  is, the  information  must 
be  symmetric  with  respect  to  b|  and  also  symmetric  with  respect  to  . 
Consider  the  unidentified  sets  o'  and  o'.  Since  all  observations  in  oj 
and  all  observations  in  o2  are  unidentified,  then  «..iy  information  on  Oj 
is  symmetric  with  respect  to  Bj  and  is  trivially  symmetric  with  respect 

to  Bz,  similarly  any  information  on  o2  is  symmetric:  separately  with  -re- 

11  _i  _1 

spect  to  Bj  and  B?.  Then  any  information  on  any  subsets  of  Oj  and  o? 

is  symmetric  separately  with  respect  to  It*  and  B^.  Trivially  x*  is 

symmetric  with  respect  to  both  blocks.  Therefore,  any  information  on  any 

subset  of  observation"  in  o  in  permitted. 

nl-rn  1 

If  the  second  action  above'  were  selected,  then  T ;■  is  defined  to 

-l-i 

contain  all  previous  information  and  all  information  o, ,  o?  and  o 
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whirh  is  ■ 

tor  di:..  .. 


to  only  tl  i’  Mock  chosen  for  division. 


. !  v.  ci.  :,j  were  chosen  for  division  and  it  was 


1  -l 

decided  that  Mock  B,  would  never  lx-  divided.  Then  the  set  o2  can  be 

-l  -1 

completely  identified  and  any  information  on  o?  or  any  subset  of  o2 


is  clearly  symmetric  with  respect  to  B  .  Also,  in  line  with  the  above 


-l 


discussion,  any  information  on  the  unidentified  set  o  or  on  any  subset 


-l 


of  Oj  or  on  x ,  is  symmetric  with  respect  to  B ^ .  It  follows  then,  any 


-l 


information  on  nnv  subset  in  o  ,  given  that  o,  is  unidentified  is 

n+m  1 


symmetric  -irh  resnoct  to  H  t . 


In  summary,  .  contains  all  previous  information  and  all  informa- 


tion  on  any 

r>uhr. 
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function  c‘)?(z,7?)  such  that,  either  it  has  a 


ihle  information.  The  cutting  function  (t>2(x,/2)  =  c2 


2  ,  i  ,  . 

i’>  =  ;  >:  ■  R  ,  .  ,■  ( ,  /  ->  >  <  C2  } 


i  (x,  J 2)  <  cp  ,  <ci(x,7i)  <  ci) 


and 


fx  t  1!  J  !  7;  (x  ,  7  ?  ) 


-  (x  1  .V  j  2  ( x ,  7 ■> )  >  c?  ,  4>i(x,7i)  ci) 


l'o  standard! ..c  ih  r 


Then  the  three  blocks 


.A't.iL  i on  let  B3  —  ^2" 

2  2  2 

defined  at  the  second  stare  are  B ( , B ? ,  and  B3. 

If  B*  were  selected  for  division  at  stage  2,  then  i2  would  be 

selected  from  the  integers  { 1  , 2  , .  . .  ,n-i  i  }  and  an  appropriate  4)2(z,72) 

function  would  be  chosen.  The  vector  x*  and  the  value  C2  would  be  such 

that  <f>2(x*,/2)  =  C2  is  the  12°^  largest  value  in  the  set  f<|>2 (x,72)  (x  C 

o'rio  }.  The  blocks  formed  would  be 
2  n 

B?  =  fx  r.  B*  |<f>2  (x,l2>  <  c2}  , 

B  3  =  fx  C  B., ,j  <t>2  (x,  1 2)  >  c2  1  , 


and 


The  information  that  is  available  for  entering  the  third  stage  (if 
desired)  constitutes  1 2,  x*,  C2 ,  the  observation  sets  o^,  o\ ,  and  o*, 
and  all  information  on  which  is  symmetric  separately  with  respect 

to  all  the  blocks  whose  corresponding  observation  sets  are  not  identified. 
By  the  above  definition,  1 2  contains  1 1,  Xj  ,  ci,  and  all  other  information 
available  at  the  begirning  of  the  second  stage. 

Stage  r  (r  n_) 

At  the  beginning  of  the  stage  the  totality  of  permissible  in¬ 
formation  consists  of  i  , ,  x*  , ,  c  , ,  the  blocks  bT  \  B2  k 

r-1  r-1  r-1  r 

and  the  corresponding  observation  sets  o^  ^,0^  ^  some  of  which 

may  be  completely  identified. 

Within  the  set  of  blocks,  that  have  not  prcviousl>  been  designated 
as  "blocks  never  to  be  divided",  one  block  is  selected  for  division  at 
the  r^1  stage.  Again,  this  blork  must  contain  at  least  one  unidentified 


F 
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observaticp  in 


i  liu  roi.Mini.iiR  blocks  must  be  classified 


cither  ns  n  block  consid.eied  for  future  division  or  as  a  block  never  to 
be  divided.  None  of  the  blocks  B^  \****B^  *  which  has  been  cate¬ 

gorized  as  a  block  never  to  be  divided  can  at  any  stage  be  reclassified 
as  a  block  considered  for  future  division. 

The  information  I  is  defined  to  consist  of  all  previous  informa¬ 
tion  and  all  information  on  o  ,  or  on  any  subset  in  o  ,  which  is  sym- 

n+m  J  n+m  1 

metric  separately  with  respect  to  all  blocks  which  could  be  chosen  for 

division  either  at  the  r*^1  stage  or  at  some  future  stage. 
r-1 

If  block  B  (for  some  j  =  1,2,..., r)  were  chosen  for  division 

at  the  stage  and  the  number  of  observations  in  o^  lying  within  ^ 

is  e.,  then,  using  J  .  select  an  integer  i  in  (l,2 . e.}  and  a  real- 

1  J 

valued  measurable  function  <f>r(z ,1  )  such  that  either  it  has  a  continuous 

null  distribution  or  there  are  no  ties  within  the  set  (<J>  (z,I  )|zeoy  h. 

r  r  1  j 

Through  an  independent  source,  the  vector  x*  e  o  and  the  value  d>  (x*,J  ) 

r  n  r  r  r 

th 

=  c  are  provided  where  c  is  the  i  largest  value  in  the  set 
r  r  r 

4  (x,i  ) lx  c  o.  n  3  }.  The  cutting  function  d>  (x,I  )  =  c  is  used  to 
r  r  1  j  n  a  r  r  r 

define  two  new  blocks  in  this  block  Br  : 

J 

Br  =  [x  C  Br-1|4>  (x,7  )  <  c  } 

J  J  1  r  r'  r 

and  , 

=  {x  c  bT  4  (x,I  )  >  c  } 
j+1  .1  1  r  r  r 

r—3  r-1  r  “  1 

The  remaining  blocks  in  {Bj  ,B2  ,...,B  }  are  relabeled  as 


Br  -  Br_1 
1  1 


for  i  =  1,2, . .  . ,  j-1 


Br  =  Br  1 
l+l  l 


for  i  =  j+1 , . . . , r 


The  total  permissible  information  available  for  entering  the  (r+1) 
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r  r  r 

stage  consists  ot  j x^,  c^,  the  blocks  B t ,B2 , . . . ,B  the  correspond- 

—  r  —  r  —  x 

ing  observation  sets  o . , o, , .  . . ,o  ,  and  all  information  which  is  sym¬ 
metric  separately  with  respect  to  the  set  of  all  blocks  in  {b^,B2,..., 
B^_^}  having  corresponding  observation  sets  which  have  not  been  identi¬ 
fied. 

The  process  H*  may  be  continued  through  the  n*^  stage  if  all  blocks 
designated  ''never  to  be  divided"  contain  no  observations  in  o^.  Or  the 
process  may  be  stopped  at  any  stage  level  if  it  has  been  decided  that  a 
sufficient  number  of  blocks  have  been  obtained  to  properly  evaluate  the 
two-sample  tolerance  test  statistic  considered.  However,  the  test,  pre¬ 
viously  selected,  may  dictate  the  number  of  blocks  to  be  formed  and  pos¬ 
sibly  the  number  of  observations  in  o  v’hich  must,  lie  within  each  block 
formed.  Most:  fwo-snmn'Jo  tolerance  tests  are  analogs  of  two-sample  uni¬ 
variate  rank  tests  and  would  possiblv  require  the  process  to  continue 
through  the  n^'  •r'-ige,  if  it  were  not  apparent  at  some  earlier  stage 

that  all  observations  in  o  have  been  identified. 

m 

If  tlie  process  B-  were  permitted  to  continue  through  the  stage 

all  blocks  formed  ll’1  ,B?, .  .  .  ,Bn . ,  are  called  basic  blocks  and  are  equiva- 

lent  (in  ttv  probabilitv  sense  defined  in  Chapter  III)  to  statistically 

equivalent  blocks  formed  bv  the  standard  one  sample  block  construction 

process  outlined  in  Chapter  TTT. 

Finally,  it  should  be  noted  that  the  standard  one-sample  block 

construction  process  is  a  special  case  of  H*.  J.et  both  sets  of  random 

vectors  0  and  ■'  denote  independent  random  samples  which  under  the  null 
n  i;> 

hypothesis  have  the  same  distribution  function,  F(x).  Then  the  joint 
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null  distribution  uf  '  :  •  rvmmetric.  At  each  stage  r,  r  =  1,2, .  . .  ,n 

n+m  ’  *  ’ 

in  B*  let  the  information  I  contain  only  the  observed  vectors  x*,x*  .... 

1  7.  9  7 

x*  and  ci,C2»»**»c  ^  and  all  prior  information  available  before  ob¬ 

taining  the  observations  within  the  camples.  Then  this  restricted  ver¬ 
sion  of  B *  is  identical  with  the  standard  one-sample  process. 


CHAPTER  V 


PRACTICAL  CONSIDERATIONS 

Several  practical  techniques  are  suggested  for  effectively  applying 
the  proposed  method  to  forming  multivariate  two-sample  tolerance  tests. 

Also  included  are  special  considerations  when  using  univariate  two-sample 
tolerance  tests,  a  suggested  operational  procedure,  and  a  discussion  on 
the  potential  problem  of  bias  associated  with  the  quality  control  tests 
presented  in  the  second  chapter.  The  terminology  and  notation  defined  in 
Chapters  II  and  IV  are  used  in  this  discussion.  The  dimensionality  of 
all  data  vectors  will  be  denoted  by  p. 

A  suggested  preliminary  procedure  that  should  be  considered  before 
applying  the  B*  process  (i.e.  before  looking  at  the  data)  begins  by  either 
selecting  an  appropriate  univariate  two-sample  tolerance  test  or  develop¬ 
ing  a  multivariate  test  which  apparently  best  applied  to  the  given  problem. 
The  next  step  is  to  determine  the  number  and  basic  block  contents  of  th" 
tolerance  regions  to  be  formed.  These  values  are  defined  directly  by  the 
test  selected  for  use.  The  third  important  step  requires  a  specific  des¬ 
cription  of  a  construction  plan  showing  the  general  order  or  layout  of 
the  blocks  to  be  formed  at  various  stages.  This  construction  plan  is  not 
intended  to  dictate  the  shapes  of  the  desired  tolerance  regions  but  merely 
to  state  a  means  for  identifying  each  particular  desired  region  once  they 
all  have  been  formed  by  the  B*  process.  In  other  words,  it  is  forbidden 
to  first  construct  a  set  of  blocks  then  decide  on  how  the  desired  toler- 


n 

ante  regions  will  be  i^,.LiUi.u  and/or  perhaps  [onned  by  a  combination 
of  blocks  (e.g.  basic  blocks).  The  final  step,  which  appears  to  only  be 
required  by  certain  tests,  is  to  associate  each  desired  region,  identified 
by  the  construction  plan,  with  a  unique  frequency  count  statistic.  That 
is,  certain  tests  may  require  that  the  desired  regions  be  preassigned  a 
fixed  order.  These  preliminary  considerations  are  further  discussed  for 
specific  tests  given  as  examples  in  this  chapter. 

Once  a  tolerance  test  has  been  selected  the  proposed  operational 
objective  in  us ing  the  B *  process  is  to  form  the  desired  blocks  accord¬ 
ing  to  the  construction  plan  so  as  to  make  the  test  statistic  as  signifi¬ 
cant  as  possible.  Thus,  it  would  appear  that  this  objective  can  best  be 
satisfied  by  visually  considering  the  set  of  unidentified  observed  vec¬ 
tors.  However,  if  the  dimensionality,  p,  of  the  data  vectors  is  large 
an  actual  ”look-at"  the  data  situation  may  prove  to  be  impractical  as  well 
as  confusing.  To  alleviate  this  problem  the  principal  component  tech¬ 
nique  is  suggested.  This  technique  will  usually  permit  the  analyst  to 
consider  only  a  two-dimensional  plot  of  transformed  data. 

The  statistical  method  of  obtaining  principal  components  C[2], 
Chapter  II)  can  be  used  as  a  numerical  technique  for  transforming  the 
original  coordinate  system  orthogonally  onto  another  p-dimensional  co¬ 
ordinate  system.  This  new  coordinate  system  is  constructed  by  choosing 
the  first  coordinate  to  have  maximum  dispersion  among  the  transformed 
data  vectors.  The  second  coordinate,  orthogonal  to  the  first  is  chosen 
to  have  the  next  largest  dispersion  among  the  transformed  data,  etc. 


*  )  ~ 


Technically,  tin-  I  i  r.-.i  u-.*  on.iii.tLi'  ).■>  de!  .ne«l  i.v  uti  eigenvector 
associated  with  the  largest  eigenvalue  of  the  scatter  matrix  determined 
from  the  set  of  observed  vectors.  Then  the  second  coordinate  is  defined 
by  an  eigenvector,  orthogonal  to  the  first  eigenvector,  associated  with 
the  second  largest  eigenvalue  of  the  scatter  matrix,  etc.  Thus,  the 
first  two  coordinates  formed  by  the  principal  component  technique  des¬ 
cribe  the  greatest  amount  of  dispersion  among  the  transformed  data.  This 
should  be  a  convenient  and  valuable  aid  for  selecting  candidate  sets  and 
cutting  functi  'ns. 

Other  numerical  or  statistical  techniques  can  also  be  used  for  this 
purpose.  For  example,  the  statistical  method  for  determining  canonical 
correlations  (  \  ?.  ],  Chapter  121  also  provide  a  now  coordinate  system. 
Actually,  any  continuous  transformation  on  the  original  p-dimensional 
space  could  be  consi derod. 

The  propose  J  proof  i  of  reducing  the  dimensionality  of  the  multi¬ 
variate  situation  by  means  of  var.ous  transformation  schemes  offers  a 
promising  approach  for  considering  the  data.  However,  the  data  charac¬ 
teristics  may  also  be  studied  bv  actually  increasing  the  dimensionality 
of  the  problem.  For  example,  considering  the  mean  vectors  and  variance- 
covariance  matrices  of  various  subsets  of  the  totality  of  unidentified 
observations  could  provide  invaluable  information  for  selecting  the  can¬ 
didate  sets  and  appropriate  cutting  functions.  Also  any  other  numerical 
methods  can  be  used  to  analyze  the  data  situation.  This  extended  freedom 
can  bo  used  to  further  describe  the  general: tv  of  the  permissible  infor¬ 
mation  on  the  multivariate  observations  defined  hv  the  P*  process. 


In  order  Le  clnriiy  cue  use  of  the  proposed  method  of  forming 

multivariate  two-sample  tolerance  tests,  four  examples  are  provided. 

A  different  test  is  considered  in  each  example.  Tn  these  examples  the 

observations  on  0  will  be  used  to  establish  the  block  frequency  counts 
m 

determined  by  the  blocks  formed  on  the  observations  on  0  .  All  schematic 
drawings  used  to  display  the  data  situations  assume  that  either  p  =  2  or 
the  data  has  been  transformed  so  that  a  two-dimensional  space  suffices. 

As  the  first  example,  suppose  m  =  1  and  n  >  1.  A  tw^-sample  toler¬ 
ance  test  can  he  developed  by  establishing  one  tolerance  region  (block) 

containing  most  of  the  observations  on  0  .  If  the  one  ob'ervation  on  0 
c  n  m 

(new  observation)  falls  outside  this  region  the  null  hypothesis  is  re¬ 
jected,  otherwise  it  is  not  rejected. 

The  basic  -.lock  content  of  this  region,  say  n  +  1  -  v,  depends  on 
the  sign! f 5 c.mcc  level  chosen  for  the  test:.  The  exact  significance 
level  of  this  test  is  the  null  probability  that  the  new  observation  falls 
outside  the  desired  tolerance  region.  This  probability  is  computed  to  be 
v/(n  +1).  If  ct  denotes  the  chosen  significance  level  of  the  test,  then 
the  value  of  v  is  determined  to  be  the  smallest  integer  such  that 

v  >_  a(n  +  1) . 

After  v  has  been  determined,  the  objective  of  the  proposed  method  is  to 

construct  according  to  some  plan  a  tolerance  region  of  content  n  +  1  -  v 

on  the  observations  on  ft  which  apparently  best  emphasizes  any  difference 

between  the  ret  of  observations  on  0  and  the  new  observation.  This 

n 

desired  region  can  be  constructed  on  one  or  in  as  many  ns  n  stages  using 


the  B *  process. 

If  it  were  considered  to  use  only  one  stage  to  construct  this 


u.- 

region  then  the  two  blocks  termed  must  he  o*  content  v  and  n  +  1  -  v, 
respectively.  In  this  case,  the  block  of  content  v  would  actually  be 
the  critical  region  of  the  test.  The  permissible  information  available 
to  form  this  first  stage  cut  would  include  all  information  on  the  com¬ 
bined  observations  which  is  symmetric  with  respect  to  0^^  and  all  inde¬ 
pendent  information  available  prior  to  taking  the  observations.  However, 
this  approach  to  forming  the  desired  tolerance  region  would  not  exploit 
all  the  advantages  of  the  B*  process. 

A  suggested  approach,  which  apparently  makes  better  use  of  the  B* 
process,  requires  that  all  n  +  1  basic  blocks  be  formed  in  a  particular 
way.  The  objective  of  this  approach  is  to  form  the  shape  of  the  desired 
tolerance  region  (also  determines  the  shape  of  the  critical  region  of  this 
test)  which  best  defines  the  difference  between  the  new  observation  and 
th«  observations  on  0  .  This  objective  may  be  accomplished  in  the  follow¬ 
ing  manner.  For  the  first  stage,  select  one  unidentified  vector  as  the 
candidate  for  the  new*  observation.  This  candidate  may  be  selected  as  the 
vector  which  lies  the  "furthest"  away  from  the  remaining  n  vectors. 

Using  these  n  remaining  vectors  for  the  observations  on  0  ,  determine  a 
real  valued  function  which  for  some  value  in  its  range  separates  the  new 
observation  candidate  from  the  remaining  n  vectors  by  enclosing  the  n 
vectors.  Set  i\  =  1  (if  the  value  of  the  function  is  directly  propor¬ 
tional  to  its  enclosed  volume)  and  obtain  the  first  stage  cut  using  this 
function.  This  cut  will  define  a  basic  block  which  should  be  located 
somewhat  near  the  center  of  the  empirical  distribution  of  the  observations 
on  0  .  Maintaining  the  same  objective  used  in  the  first  stage  the  process 
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may  b« •  .  ont  inm-.:  tn  t  >  i  in-  t  \i  -.taRO.  Of  course,  at  cacti  snca-i.ul:^ 

st  ape  a  new  ohse  r-vtion  r  md  f  d  <to  an  d  functions  nay  hr  chosen  differently. 
However,  if  at  any  stage  the  new  basic  block,  formed  contains  an  observa¬ 
tion,  this  observation  must  he  the  new  observation  and  the  null  hypothesis 
cannot  be  rejected  concluding  the  test.  If  the  new  observation  has  not 
been  identified  by  the  end  of  the  (n-v)1^  stage  it  must  lie  within  the  re¬ 
maining  tolerance  region  of  content  v  +  1  and  the  process  must  continue. 

Throughout  these  n-v  stages  the  objective  is  to  select  cutting 
functions  so  as  to  exclude  the  new  observation  candidate  from  the  basic 
blocks  formed.  Since  more  information  is  provided  by  the  B-'  process  at 
each  successive  stage,  the  cutting  function  defined  at  the  (n-v) ^  stage 
should  reasonably  well  represent'  the  shape  of  the  empirical  distribution 
defined  by  the  observations  on  (1  .  A  schematic  drawing  showing  the 
general  appearances  of  the  n-v  cutting  function  is  given  in  Figure  la. 

Note  at  this  point  the  (n  +  1  -  v)1"*1  cutting  function  which  will  define 
the  desired  tolerance  region  has  not  been  formed.  "it  -raid  be  formed 
in  the  next  stage;  however,  more  information  pertaining  to  the  "best" 
shape  of  this  region  can  be  obtained  bv  forming  v  -  1  more  basic  blocks. 

In  the  next  stage  the  integer  i  ,,  is  set  equal  to  v  and  the  cut- 
ting  function  is  chosen  in  a  similar  way  used  in  obtaining  the  cutting 
functions  in  the  previous  n-v  stages  hut  with  the  new  objective  to  in¬ 
clude  the  new  observation  candidate  within  the  basic  Mock  formed.  The 

basic  block  formed  at  this  stage  includes  all  points  in  the  sample  space 
lying  outside  the-  region  enclosed  by  the  cut.  The  same  objective  is  used 

to  construct  the  next  v  -  2  basic  blocks.  I f  at  any  stage  an  observation 


1 


lies  within  the  basic  hi:  .  f ormo.l ,  this  observation  must  be  the  new 
observation  and  the  null  hypothesis  is  rejected  concluding  the  test. 

If  after  these  n-1  stages  the  new  observation  has  not  been  identi¬ 
fied,  the  remaining  region  (basic  block  content  is  two)  must  contain 
exactly  two  vectors.  One  of  these  unidentified  vectors  is  some  observa¬ 
tion  on  ^  and  the  other  vector  must  be  the  new  observation.  A  schematic 
n 

of  this  situation  is  given  in  Figure  lb  showing  the  remaining  region.  At 
s  t 

the  end  of  the  (n-1)  stage  two  reasonably  well  shaped  cutting  functions 
defining  the  remaining  region  can  then  be  used  advantageously  to  define 
the  shape  of  the  final  cut. 

This  latter  approach  may  not  seem  too  practical  especially  if  n 
were  very  large.  A  similar  method  could  be  used  in  which  blocks  of 
content  two  or  more  arc  considered  at  each  stage.  This  method  would  re¬ 
quire  fewer  stages  of  the  B*  process  and  would  yield  a  favorable  toler¬ 
ance  region.  However,  regardless  of  the  approach  used  it  must  be  decided 
(before  applying  the.  F*  process)  how  the  desired  tolerance  regions  will 
eventually  be  defined  and  how  they  occur  in  the  test  statistic. 

Tests  of  this  type  have  direct  application  in  the  quality  control 
tests  (Chapter  II).  For  example,  suppose  the  first  data  sets  contain  n 
observations  and  all  other  data  sets  consist  of  exactly  one  observation 

each.  If  it  were  decided  that  for  each  subtest  the  critical  region  would 

th 

be  of  basic  block  content  v,  then  the  exact  significance  level  of  the  j 
subtest  would  be  v/(n+j)  if  no  randomization  were  used  in  all  subtests. 
The  significance  level  of  the  overall  test  would  in  this  case  be 


k 

-  II  (1 

j  =  l 


ir+~"j)  - 1  -  0/<n  v  k> 


d 


a  =  1 
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whore  k  denotes  the  n.i,-  i mum  mi-iher  of  subtests .  From  this  expression, 
the  vilue  either  k,  a,  n,  or  v  may  be  determined  if  the  other  values 
are  specified.  For  example,  if  v  -  1  for  all  subtests,  then  the  value  of 
k  would  be  determined  as  the  smallest  integer  satisfying 


Other  similar  quality  control  tests  could  be  established  using  two-sample 
tests  of  this  type. 

As  the  next  example,  suppose  that  the  Wilcoxon-  Mann  -  Whitney 
test  [10]  has  been  selected  for  application.  The  statistic  associated 
with  this  test  is 

m 

II  =  Y.  r.  -  m(n  +  m  +  11/2 
i=l  1 

where  r.,  i  =  1,2,... ,m  are  the  "ranks"  of  the  observations  on  0  .  Some 
l  m 

properties  cf  this  test,  considered  for  the  univariate  case,  are  presented 
in  reference  [14]  on  pages  61  through  68.  The  null  distribution  of  this 
two-sample  test  for  certain  ranges  of  n  and  m  arc  tabulated  in  reference 
[  4  ].  For  rather  large  n  and  m  the  null  distribution  of  U  can  be  approxi¬ 
mated  by  the  normal  distribution. 

In  the  univariate,  case,  this  test  best  emphasizes  alternative  hy¬ 
potheses  inferring  slippage  in  the  location  between  the  two  populations. 
That  is,  it  will  detect  with  rather  high  efficiency  whether  one  popula¬ 
tion  is  statistically  less  than  (or  greater  than)  the  other  population. 
This  test,  however,  is  not  efficient  for  testing  a  difference  in  disper¬ 
sions  if  the  two  populations  have  nearly  the  same  location. 

To  study  the  multivariate  analogy  of  this  test  (or  any  other  test) 
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lt  is  highly  recommended  that  the  statistic  be  expressed  in  terms  of  the 

appropriate  block  frequency  counts.  In  the  case  of  the  Wilcoxon-Mann- 

Whitney  test,  the  "ranks"  of  the  observations  on  0  can  only  be  det -r- 

mined  by  basic  blocks.  Thus,  the  U  statistic  should  be  rewritten  in 

terms  of  mi  ,tn2  ,  •  •  •  ,m  ^  ,  the  basic  block  frequency  counts. 

First  note  that  the  "ranks"  of  the  observations  on  0  are 

n 

n 

mi  4-  1,  mi  +  m2  +  2,...,  T,  m.  +  n. 

i=l  1 

Then  the  "r  inks"  of  the  observations  on  fl  vrould  be  the  remaining  inte¬ 
gers  in  the  set  {l ,2 , . . . ,n+m} .  Therefore,  the  sum  of  the  ranks  r^  of  the 
observations  on  t?  is  equivalent  to  the  difference  between  the  sum  of  all 

ranks  in  the  combined  set  of  observations  on  0  .  and  the  sum  of  the  ranks 

r>-*-m 

of  the  observations  on  0  .  This  gives 

n 


n+r,i 


>:  r. 
i=l  1 


-  j  -  T-  {  T.  mi  +  j} 


j=l  i=l 


n+1 


=  m(m  -  »  -  l)/2  +  nm/2  +  X  j  m. 


i=l 


n 

since  m  -  m  =  T.  m.. 

n+1  .  ,  j 

J  =  1 


Thus  the  I'-stnt isti c.  expressed  in  terms  of  the  basic  block  fre¬ 
quency  counts  simply  hr comes 
n+1 


U 


m(n+2) 


j  =  l 


1  -  2 


This  implies  that  the  value  of  l'  depends  not  only  on  the  basic 


block  frequency  counts  but  also  on  tbc  way  in  which  the  blocks  arc  ordered. 


To  appropriately  eat-  the  Wi i coxon-Mann-Whitnr/  fust  as  z  multivariate 
two-sample  tolerance  test,  a  method  of  ordering  the  basic  blocks  to  be 
determined  by  the  B*  process  must  be  prespecified.  This  additional 
consideration  could  be  resolved  by  specifying  the  general  manner  in  which 
the  basic  blocks  are  to  be  formed. 

Another  important  consideration  is  the  interpretation  of  the  U- 
statistic  when  defined  on  some  set  of  pre-ordered  basic  blocks.  In  the 
univariate  case  the  basic  blocks  are  determined  by  the  order  statistics 
r*d  are  ordered  in  the  natural  way.  This  ordering  provides  the  basis  on 
which  the  U-statistic  was  originally  interpretea.  That  is,  if  the  U- 
statistic  obtained  a  value  near  either  its  lowest  or  highest  possible 
values,  then  this  would  be  interpreted  correctly  to  mean  that  the  simple 
two-sample  null  hypothesis  was  probably  not  true.  However,  in  the  multi¬ 
variate  case  the  interpretation  of  the  U-statistic  would  depend  largely 
on  the  ordering  and  relative  locations  of  the  basic  blocks.  If  the  basic 
blocks  were  ordered  in  any  haphazard  way,  then  any  logical  interpreta¬ 
tion  of  various  values  of  the  U-statistic  would  be  difficult  to  express. 

If  it  were  desired  to  interpret  the  U-statistic  in  the  same  concept 
used  in  the  univariate  case,  two  situations  must  be  considered. 

First,  suppose  that  a  two-sided  U-test  were  selected.  In  the  multi¬ 
variate  case  the  alternative  hypotheses  to  be  emphasized  should  reflect 
that  the  two  populations  differ  in  location  in  some  direction  in  the  p- 
dimensional  space.  The  following  is  a  suggested  procedure  for  establish¬ 
ing  and  ordering  the  basic  blocks  for  this  two-sided  U-test. 

(1)  Using  some  numerical  procedure  (e.g.  least  squares)  determine 
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Che  best  tiL  ot  trie  combinco  unidentified  observed  vectors  to  a  straight 
line.  This  line  can  be.  assumed  to  represent  the  "most-likely"  direction 
of  any  location  difference  between  the  two  populations. 

•;2)  V  first  stage  ce,_  is  made  by  a  hyperplane  orthogonal  to  the 
line  (established  in  (1))  passing  through  the  (a)  median  (with  respect 
to  the  hyperplanc)  of  the  observations  on  0  ,  that  is,  ii  is  selected  to 
be  an  integer  nearest  to  (^~) . 

(3)  The  two  blocks  defined  ->t  the  first  stage  are  then  divided 
into  basic  blocks  hv  forming  a  series  of  blocks  radiating  out  from  this 
hyperplane.  This  can  be  accomplished  by  choosing  cutting  functions 
nearest  the.  center  of  the  empirical  distribution  of  the  candidate  vectors 
for  the  observations  on  0^  in  each  subsequent  stage  (see  Figure  2a) . 

This  construction  approach  insures  that  the  cutting  functions  will  des¬ 
cribe  the  shapes  of  the  empirical  probability  surfaces  of  the  observa¬ 
tions  on  0 in  the  "tails".  The  order  established  by  the  B*  process 
given  by  the  subscripts  in  the  set  {B^.B^ , . . . ,B™+^}  would  be  a  natural 
basic  block  ordering  suitable  for  the  desired  interpretation  of  the  13- 
statistic. 

Now  suppose  that  a  one-sided  U-test  were  desired.  The  alternative 
hypotheses  associated  with  this  test  in  tlie  univariate  case  would  reflect 
one  population  is  stochastically  larger  (or  smaller)  than  the  other  popu¬ 
lation,  In  the  multivariate  case  the  alternative  hypotheses  could  be 
either  that  one  population  is  stochastically  larger  (or  smaller)  than  the 
other  in  some  direction  or  that  the  two  populations  differ  in  location 
where  the  direction  is  not  specified.  To  test  the  first  form  of  the  al- 


temative  hypothesis  the  same  procedure  outlined  above  for  the  two-sided 
test  could  be  used.  To  test  the  second  form  of  the  alternative  hypothe¬ 
sis  another  method  is  suggested  (see  Figure  2b). 

(1)  Determine  the  mean  vector  of  the  candidate  vectors  that  are 

associated  with  the  observations  on  0  .  If  the  candidate  sets  are  dif- 

n 

ficult  to  determine  the  mean  vector  of  the  totality  of  observed  vectors 
can  be  used. 

(21  Tn  the  first  stage  take  ii  =  1  and  make  the  first  cut  centered 
jjout  the  mean  vector.  The  shape  of  the  cut  should  perhaps  be  determined 
ay  the  empirical  distribution  established  on  the  candidate  set  for  obser¬ 
ve''  ic-s  or  '  - 
r 

(3)  kedefine  (if  necessary)  the  candidate  set  for  the  observations 

on  ?  based  on  the  information  made  available  by  the  first  stage.  Then 

v. 

reaiT:  f.  !,'•  and  (2)  for  i2  =  1. 

til 

(4)  Continue  the  procedure  to  the  n  stage.  The  resulting  order 
of  t'ae  basic  Mocks  defined  by  the  /?*  process  can  then  be  used. 

F  te:  This  last  procedure  does  not  necessarily  result  in  concentric 
cutting  functions  but  the  cutting  functions  will  tend  to  radiate  out  from 

the  center  of  the  empirical  distribution  of  the  observations  on  0  . 

n 

/-•■'•in,  the  shapes  of  the  cutting  functions  determined  in  the  latter 

.  .:ges  of  the  B*  process  closely  describe  the  shape  of  the  empirical 

distribution  of  the  observations  on  0  .  That  is,  the  differences  between 

n 

the  observations  in  the  "tails"  of  this  empirical  distribution  may  be 
emphasized  by  this  approach. 

Another  approach,  which  appears  to  be  better  for  forming  these 


basic  blocks  for  the  one-sided  U-test  and  may  be  considered  for  other 
tests,  seems  to  contradict  one's  natural  intuition.  This  approach  is  as 
follows : 

(1)  Using  the  totality  of  unidentified  observations  determine  the 
new  coordinate  system  of  principal  components. 

(2)  For  the  first  stage  obtain  an  acceptable  real-valued  function 
on  this  new  coordinate  system  which  apparently  best  describes  the  general 
contour  of  the  empirical  distribution  defined  by  the  candidate  set  for 
the.  observations  on  <3^  (or  if  not  possible,  on  the  totality  of  observa¬ 
tion)  . 

(3)  Rotate  this  function  through  its  center  by  making  appropriate 

transformations  and  i nterchanging  the  roles  of  the  i'  principal  componen 
s  t 

with  the  (p  -  i  +1)  principal  component  for  i  =  1,2, . . . , [p/2]  ([x] 
denotes  Lhc  largest  integer  less  than  or  equal  to  x) . 

(4)  Set  i. i  =  1  and  determine  the  first  cut  using  this  rotated 
function. 

(5)  Repeat  steps  (2)  through  (4)  setting  i^  =  1  for  j  =  2,3,...,n 
and  possibly  redefining  new  functions  and  candidate  sets  at  each  stage. 

A  schematic  picture  of  this  approach  is  given  in  two-dimensions  in 
Figure  3. 

Since,  the  first  principal  component  contains  the  greatest  amount  of 
dispersion  among  the  transformed  observations,  this  component  axis  may 
represent  the  most  likely  direction  showing  any  differences  in  location 
between  the  two  populations.  The  second  principal  component  axis  indi¬ 
cates  the  next  most  likely  direction  of  location  differences,  etc.  The 


objective  of  this  approach  i:.  to  shape  the  blocks  in  such  a  way  so  that 
the  least  number  of  blocks  (in  the  "tail"  for  the  one-sided  U-statistic) 
contain  the  greatest  number  of  observations  on  <9^.  By  transforming  and 
rotating  the  functions  according  to  the  above  method,  the  chances  of 
accomplishing  this  objective  appears  to  be  rather  good  for  emphasizing 
the  alternative  hypotheses. 

Since  this  one-sided  test  can,  in  this  case,  be  used  to  emphasize 
general  two-sided  alternatives  that  are  associated  with  the  U-test,  then 
it  would  appear  to  have  greater  power  than  the  two-sided  test.  The  same 
construction  techniques  suggested  for  the  multivariate  one-sided  U-test 
applies  directly  to  the  univariate  case  as  well.  Thus,  all  univariate 
two-sided  alternatives  can  be  treated  by  a  univariate  one-sided  U-test. 

This  example  was  selected  to  emphasize  the  fact  that  not  all  ap¬ 
propriate  univariate  two-sample  tests  can  readily  be  applied  to  the 
multivariate  situation  by  disregarding  the  interpretation  of  the  test 
statistic  and  any  other  consideration  to  be  imposed  on  the  content  and 
use  of  the  blocks.  Some  tests  may  be  applied  directly  without  any  re¬ 
striction  on  the  block  usage.  This  is  shown  by  the  next  example. 

Suppose  that  the  Dixon  C2  test  [  5  ]  is  considered  for  application. 
In  the  univariate  case  this  test  is  consistent  and  moderately  efficient 
for  virtually  all  alternatives  of  interest.  The  test  statistic  is  ex¬ 
pressed  directly  in  terms  of  the  basic  block  frequency  counts: 


C2 


n+1 

r. 

i=J 


A  thorough  description  of  the  properties  of  the  C2  test  is  provided  in 
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reference  ( 14]  on  pages  I  !>J  and  l‘>4.  The  null  distribution  of  C2  is 
tabulated  for  some  n  and  m  in  reference  f  r>  1  .  If  a  <  1/2  denotes  the 
selected  significance  level  of  the  test  and  if  nm/(n+m)  >_  6  and  (n+m)/ 
(Anm)  _<  a  the  null  distribution  of  C'  can  be  approximated  by  the  chi- 
square  distribution.  The  Dixon  C2  test  is  always  one-sided. 

Since  the  C2-statistic  does  not  depend  on  any  preordering  of  the 
basic  blocks,  then  the  interpretation  of  the  C2  statistic  for  the  multi¬ 
variate  case  remains  unchanged  from  the  univariate  case  as  long  as  a 
logical  procedure  for  forming  the  blocks  has  been  established.  Thus, 
large  values  of  C‘  will  indicate  in  either  case  that  the  null  hypothesis 
is  probably  not  true.  If  the  exact  form  of  the  alternative  hypothesis 
cannot  be  specified,  it  would  appear  that  the  Dixon  C2  test  would  be  a 
most  appropriate  choice. 

Next,  a  few  particular  data  situations  are  considered  for  applying 
the  C2  test  (or  most  any  other  appropriate  test  whose  statistic  does  not 
depend  on  an  ordering  of  the  blocks  formed). 

As  stated  earlier,  the  operational  objective  of  the  proposed  method 
is  to  construct  the  desired  tolerance  regions  (blocks)  by  trying  to  make 
the  test  statistic  as  significant  possible  subject  to  the  rules  defined 
by  the  B*  process  and  any  other  additional  considerations.  The  decision 
of  which  blocks  to  divide  or  not  to  divide,  the  choice  of  candidate  sets, 
and  the  selection  of  a  cutting  function  at  each  stage  should  naturally 
depend  on  the  test  statistic,  the  set  sizes  n  and  m,  and  on  the  signifi¬ 
cance  level  of  the  test.  For  example,  suppose  n  -  m  =  4  and  the  test 

significance  level  was  chosen  to  be  0.10,  then  the  Dixon  C‘  test  would  he 


-  'l<> 

to  reject  the  nul  L  hypot hesis  if  l'/  >_().&,  otherwise  it  is  not  rejected. 

This  critical  region  is  onlv  obtained  whenever  one  of  the  five  basic 

blocks  contain  all  four  of  the  observations  on  0  .  If  the  plotted  un¬ 
til 

identified  (transformed)  observation  vectors  yielded  the  data  situation 

given  in  Figure  4a,  then  the  best  intuitive  procedure  is  to  select  four 

points  lying  in  what  appears  to  he  a  cluster  as  the  candidates  for  the 

observations  on  0  .  the  other  points  are  then  candidates  for  the  obser- 
n 

vations  on  0  .  In  the  first  stage  take  ii  =  4  and  a  real-valued  function 
which  best  describes  (for  convenience  circles  are  used  in  Figure  4)  a 
boundary  about  the  candidate  set  for  0  .  If  after  establishing  the  first 
stage  cut  the  two  blocks  take  the  form  given  in  Figure  4b,  then  all  ob¬ 
servations  on  n  are  dearie  identified  since  they  lie  within  one  basic 
block  -  then  the  mill  hypothesis  is  rejected  at  the  first  stage.  If  the 
first  stage  blocks  take  the  form  shown  in  Figure  4c,  then  only  x*  con  be 
identified  and  a  second  stage  is  required.  A  new  set  of  candidate  points 
for  the  observations  on  c  are  selected,  perhaps  those  nearest  the  identi¬ 
fied  observation  x".  A  second  function  emcompassing  these  points  is  used 
to  determine  the  second  stage  cut  (for  i?  =  3).  Then  if  the  result  given 

by  Figure  4d  is  obtained  the  observations  on  0  lie  in  a  basic  block 

m 

rejecting  the  null  hypothesis.  If  the  result  described  by  Figure  4c 
occurred  then  the  null  hypothesis  cannot  he  rejected.  Also,  if  at  the 
first  stage  the  resulting  blocks  took  the  general  form  given  by  Figure 
4f  the  null  hypothesis  could  not  be  rejected.  A  similar  approach  could 
be  used  whenever  the  data  yielded  two  reasonably  well  defined  clusters 


of  sizes  u  and  n. 
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Now  suppose  the  data  situation  yields  one  cluster  of  points.  In 
this  case,  it  may  be  rather  difficult  to  select  tile  candidate  sets.  The 
suggested  technique  for  handling  this  situation  is  to  first  determine 
the  mean  vector  for  the  totality  of  unidentified  observed  vectors.  Take 
i)  =  1  and  define  a  first  stage  cutting  function  whose  center  is  at  the 
computed  mean  vector,  as  in  Figure  5a.  Considering  all  the  permissible 
information  available  at  the  end  of  the  first  stage,  in  particular  the 
relative  positions  of  the  vector  x*  and  the  remaining  unidentified  ob¬ 
served  vectors,  it  may  now  be  possible  to  select  the  apparently  "most- 
likely"  candidate  sets.  Then  continue  the  P*  process  selecting  cutting 
functions  which  tend  to  radiate  out  from  the  center  (see  Figure  5b). 
However,  if  candidate  sets  cannot  be  reasonably  defined,  set  ±2  =  1  and 
select  the  second  stage  cutting  function  to  have  its  center  at  x*,  etc. 

Of  course,  there  r re  many  other  data  situations  that  could  occur 
(e.g.  three  or  mor^  dist'nct  data  clusters).  Similar  procedures  for 
handling  these  situations  could  he  established  as  long  as  the  basic 
operational  objective  remains  unchanged. 

The  final  example  considers  the  use  of  Mathisen's  quartile  test 
[11].  This  test  is  based  on  the  frequency  counts  determined  by  tolerance 


regions  representing  25  percent  regions  defined  by  the  observations  on 

0  .  The  test  requires  that  (n+l)/4  and  m/4  are  both  integers.  The  test 
n 

statistic  i?  given  by 


H  =  16  X  [m.  -  y]  7  / (9m7 ) 
i=l  1  4 


where  tn  ,  i  • 
cent  regions. 
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Figure  5a. 


■  ] 


Figure  5b. 


w 


re  if  roue e  []/»).  A  tabic  ...  i...  nul  !  distribution  is  given  in  reference 

[11).  For  certain  n  and  m  the*  nul  ]  distribution  can  be  approximated  by 
the  beta  distribution.  In  the  univariate  ease  this  test  emphasizes 
simultaneously  differences  in  location,  dispersion  and  skewness.  The 
test  is  one-sided  and  is  rat'  er  similar  to  the  Dixon  C2  test  for  n  =  3. 

Since  the  B-statistic  does  not  depend  on  any  ordering  of  the  blocks 
(25  percent  regions)  no  special  considerations  of  block  orderings  are 
necessary.  Two  approaches  for  forming  the  desired  regions  are  given, 
iiie  first  approach  us.  -s  only  three  stages  of  the  !<*  process  in  the  most 
effective  manner,  while  the  other  approach  requires  all  n  stages. 

The  first  suggested  procedure  is  to  select  iiet^r-  ,  }  and 

choose  some  appropriate  cutting  function  depending  on  the  data  situation. 
This  first  stage  cut  divides  the  sample  space  into  a  block  of  content 


n+ 1  .  ,  .  ,  ,  3 ( n+ 1 ) 

and  anoth.-r  block  of  content  — - - — 

u  4 


Since  the  desired  tolerance 


regions  are  25  percent  regions  the  M  ock  "of  content cannot  be  divided 
at  any  later  stage.  By  the  rules  defined  by  the  P*  process,  all  observa¬ 
tions  in  this  block  can  he  identified.  This  information  (if  not  sufficient 
to  conclude  the  test)  should  prove  extremely  valuable  in  selecting  the 
second  stage  candidate?  sets  and  cutting  function.  In  the  second,  the  re¬ 
maining,  block  is  bisected  by  choosing  -  f .  Again,  one  block 

of  content  is  formed  (the  other  block  is  of  content  -)  .  All  obscr- 

‘t  L 

vm  ions  in  this  block  .an  be  identified.  All  this  information  can  now  be 

n+]  * 

used  to  select  the  third  stage  cutting  function  (for  (3  =  — j")  to  bisect 


the  remaining  block  of  content 


n+1 


The  second  suggested  procedure  is  to  predefine  some  scheme  or  con- 


struction  plan  for  forming  Liu:  desired  tolerance  regions,  from  basic 

blocks.  This  scheme  must  he  defined  before  "looking-at"  the  observed 

data.  One  .simple  approach  is  to  use  the  B*  process  by  selecting  cutting 

functions  to  form  basic  blocks  whose  centers  are  L'he  apparent  centers  of 

the  candidate  sets  for  the  observations  on  0  .  These  cutting  functions 

n 

will  tent!  to  radiate  out  from  the  actual  center  of  the  observations  on 
0  ,  in  a  fasti' o..  shown  by  Figure  5b.  The  ,  and  stage 

cutting  functions  will  then  define  the  desired  25  percent  tolerance 
regions . 

The  null  dist  r ’but ions  referenced  for  the  above  statistics  were 
derived  oncer  'lie  unronefi  t.  ional  probability  model.  Ry  the  corollary 
_L;ive::  in  Chanter  VII.  these  distributions  also  hold  for  the  conditional 

pa  rev 1  L  a!.  ’  or  model. 

A  forma J  out  I  ire  of  the  suggested  operational  procedure  can  now  be 


fl)  Consider  all  independent  information  that  is  available  prior 
r  •  *.!•••  observations.  This  information  may  include  previous  obser¬ 

vations,  t!ie  nodal  characteristics  of  the  underlying  distributions,  etc. 

(2)  Select  or  construct  an  appropriate  two-sample  tolerance  test 

•>-„  •  -or  f  •  i  dor,  ‘  significance  level. 

(3)  Carefully  consider  the  test  statistic  expressed  in  terms  of 
:..-e  appropriate  block  f requenev  counts.  From  this,  determine  the  number 

;  boric  block  contents  of  the  tolerance  regions  to  bo  formed  and  any 
'  -‘-’i  t'onol  consideration':  (e.g.  const,  ruet  i  on  plan,  block  order,  ctc.l 
•-hat  nav  be  imposed  on  Hie  block  usage. 


eliminated  the  B *  process  is  violated  and  the  subtests  are  no  longer 
independent.  Some  suggestions  for  alleviating  this  bias  are  presented. 

One  approach  is  to  list  (if  possible)  a  set  of  fixed  rules  which 
generally  apply  for  all  data  situations  that  could  occur  at  any  stage  in 
the  B*  process.  These  rules  are  used  to  make  decisions  for  selecting 
the  candidate  sets  and  possibly  the  cutting  functions  at  each  stage. 

One  elementary  example  is  provided. 

Suppose  all  new  data  sets,  except  the  first  data  set,  consists  of 
only  one  observation.  Then  the  rule  to  always  select,  as  the  candidate 
for  the  new  observation,  the  one  unidentified  observation  which  lies  the 
"furthest"  away  from  the  set  of  remaining  unidentified  observed  vectors. 
The  concept  of  distance  in  this  rule  may  be  defined  by  fitting,  say,  an 
ellipsoid  function  to  the  set  of  all  data  points.  Then  the  observed 
vector  yielding  the  largest  value  of  this  function  will  be  the  candidate 
■for  tticf new' 'observ ation".  This  rule  can  be  repeated  at  each  stage  of  the 
B*  process  us .ng  only  the  unidentified  observed  vectors. 

The  approach  of  alternating  or  employing  different  analysis  for 
each  subtest  shouid  reduce  or  eliminate  all  bias.  This  approach  appears 
to  be  direct  and  simple  to  apply. 

In  the  next  chapter  several  areas  of  application  are  discussed. 
Most  of  these  areas  are  limited  to  medical  applications. 


CHAPTER  VI 


SOME  AREAS  OF  APPLICATION 

The  sequential  significance  tests  having  multivariate  two-sample 
tolerance  subtests  presented  in  this  paper  apnear  to  he  generallv  ap¬ 
plicable  to  most  quality  control  as  well  as  to  other  testing  situations. 
A  few  specific  medical  applications  are  cited.  Other  areas  apparently 
submissive  to  these  tests  arc  listed  at  the  end  of  this  chapter. 

The  first  application  to  be  considered  is  the  quality  control  of 
a  system  used  to  determine  the  electrophoretic  analysis  of  serum  pro¬ 
tein.  This  method  of  characterising  serum  protein.,  has  provided  better 
understanding  of  associated  clinical  disorders  and  in  some  instances 
has  aided  in  recognizing  new  diseases  complicated  by  serum  protein  ab- 
normaiirto's ; . - . - . . . . - . — . - - -  - . . . . 

Elcctrophoreti call y  separated  serum  proteins  are  classified  in 
five  rather  distinct  groups:  albumin,  o.\~,  a.?.-,  3-,  and  y-  globulins. 
The  basic  results  of  an  electrophoretic  serum  analysis  are  given  by  the 
concentrations  of  these  protein  groups.  These  measurements  are  usually 
expressed  in  terms  of  the  fractions  of  total  protein  concentration.  The 
systems  used  to  obtain  those  measurements  is  influenced  by  several,  fac¬ 
tors:  human,  mechanical,  chemical,  and  electrical. 

Present  quality  control  techniques  used  to  test  the  system  opera¬ 
tion  consider  each  of  five  univariate  measurement's  independently  of  the 
other  four.  This  method  of  testing  would  he  highly  Questionable  if 
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there  exist  any  dependent  relationships  among  the  five  variables.  This 
v/ould  not  be  a  problem  if  multivariate  tests  were  used. 

The  standard  method  of  testing  the  quality  of  this  system  is  a 
sequential  quality  control  test  which  is  very  similar  in  structure  to 
that  presented  in  Chapter  II.  That  is,  the  previous  data  is  continually 
reused;  however,  there  is  no  regard  for  independence  between  subtests. 

A  reserve  bank  supplies  the  source  of  serum  used  to  conduct  the  quality 
control  tests.  The  serum  in  this  bank  is  replenished  periodically  by 
sampling  from  the  excess  of  serum  tested  over  previous  days.  The  serum 
samples  are  combined,  homogeneously  mixed,  and  frozen  for  preservation. 
For  each  subtest  one  or  two  samples  are  taken  from  the  serum  reserve 
bank  and  electrophorctically  processed  in  the  system.  The  results  are 
analyzed,  then  tested  against  previous  results  to  determine  if  the 
system  is  in  or  out  of  control. 

Since  the  multivariate  observations  consist  of  five  continuous 
variables,  the  proposed  method  of  forming  two-sample  tolerance  tests 
trivially  holds.  The  test  situation  then  appears  to  conform  well  to  the 
quality  control  test  presented  in  the  first  example  of  Chapter  V. 

The  next  application  is  to  clinical  trials.  The  objective  of 
clinical  trials  is  to  compare  the  effect  of  some  treatment  (e.g.  a  drug) 
to  some  standard.  This  standard  may  be  described  by  measurements  on 
untreated  patients  or  on  patients  subjected  to  a  different  treatment. 

The  m'.uC'ireraents  used  for  comparison  are  in  the  form  of  symptoms,  signs, 
and/or  clinical  findings.  One  approach  to  clinical  trials  is  to  enter 
one  patient  at  a  tim--  into  1  ,ie  experiment.  A  set  of  measurements  on  the 


treated  patient  is  used  to  compare  against  the  standard  measurements. 

The  sequential  testing  is  continued  until  significance  occurs  or  a 
maximum  number  of  tests  have  been  conducted. 

All  measurements  of  symotoms  and  signs  are  usually  considered 
discrete;  however,  most  clinical  findings  (e.g.  temperature,  weight, 
blood  chemistry,  serum  protein  analysis,  etc.)  are  continuous  variables. 

If  at  least  one  of  these  continuous  variables  are  included  in  the  measure¬ 
ment  of  treatment  response,  then  the  nroposed  method  oc  forming  two- 
sample  tolerance  tests  can  be  used  to  determine  subtests.  Again,  the 
quality  control  test  given  in  the  first  example  of  Chanter  v  ran  be  ao- 
plied  to  this  problem. 

A  rather  common  problem  in  medical  research  (similar  to  problems 
in  other  scientific  and  engineering  disciplines)  is  cited  next.  Suppose 
three  groups  of  patients  arc  involved  in  an  experiment  where  each  group 
is  subjected  to  a  different  treatment.  From  limited  independent  previous" 
knowledge,  there  is  reason  to  believe  that  two  of  these  treatments,  say 
A  and  B,  do  not  differ  in  their  measured  responses,  while  the  third 
treatment,  C,  response  affect  is  either  unknown  or  is  believed  to  differ 
from  the  other  two.  The.  desired  test  procedure  is  to  first  test  the 
null  hypothesis  that  treatments  A  and  B  yield  the  same  effects,  then  if 
this  hypothesis  cannot  be  rejected  test  if  treatment  C  differs  from  the 
combined  affects  of  treatments  A  and  B.  If  the  observations,  used  to 
measure  the  treatment  effects,  are  at  least  partially  continuous,  then 
the  sequential  significance  tests  having  the  proposed  multivariate  two- 
sample  tolerance  tests  as  its  subtests  can  be  made  to  apply  to  this 


procedure . 
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An  extension  to  this  problem  follows.  Suppose  there  are  k  (k  3) 
groups  of  pat  ants.  Each  group  is  subjected  to  a  different  treatment. 
Then  prior  to  taking  the  observations,  the  treatments  are  ordered  ac¬ 
cording  to  their  believed  differences.  That  is,  those  treatments  con¬ 
sidered  first  in  this  ordering  are  assumed  to  produce  near  similar 
responses,  etc.  Again,  if  the  observations  are  at  least  partially  con¬ 
tinuous,  the  tests  proposed  in  this  paper  can  be  used. 

Some  other  general  areas  of  possible  application  are:  water  and 
waste  treatment  plant  quality  control,  traffic  studies,  scientific  and 
engineering  research,  industrial  quality  control,  market  and  other 
sampling  surveys. 


I 


CHAPTER  VII 

STATEMENT  OF  BASIC  RESULTS 

Two  major  results  are  presented  in  the  form  of  a  theorem  and  corol¬ 
lary.  All  other  basic  results,  some  of  which  are  direct  consequences  of 
the  theorem  or  corollary,  are  verified  in  an  informal  format. 

The  following  theorem  proves  that  the  joint  null  distribution  of 
block  frequency  counts  obtained  at  any  stage  in  the  B*  process  is  the 
same  as  if  it  were  obtained  by  the  standard  one-sample  process.  The 
proof  is  given  in  Appendix  II. 

Theorem 

Let  0  =  { X i , X2 , . . . ,X  }  and  0  =  {Y j ,Y2 , . . . ,Y  }  be  two  sets  of 

n  ’  n  m  m 

random  vectors,  not  necessarily  independent,  defined  on  a  sample  space  X. 

These  sets  are  such  that  there  is  a  probability  one  of  the  construction 

process  B*  being  unique  (no  ties  in  the  cutting  function  values)  which 

occurs  in  particular  whenever  the  random  vectors  are  at  least  partially 

continuous.  Under  the  null  hypothesis,  let  the  combined  set  of  random 

vectors  0  ,  =  (Xi,X2,...,X  ,  Yi,Y2,...,Y  }  have  a  symmetric  joint  cumu- 
n+m  n  m 

lative  distribution  function  denoted  by  F  =  F(xi,x2 . x  ,  y 1 ,y2 , . . . >y_) • 

n  m 

Given  a  set  of  r  +  1  blocks  (B^.B^ ,  • . .  formed  at  the  rth  stage  of 

the  generalized  block  construction  process,/?*,  on  a  set  of  observations 
on  0  ,  then  the  joint  null  distribution  of  mi ,m? , . . . ,mr+1 ,  the  respective 
block  frequency  counts  on  0  ,  is  given  by 
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r+l  .  , 

{  n  (mi  ki 


1-4 


m. 

1 


1)}/( 


m  +  n. 


for  non-negative  integers  m.,  i  =  1,2,..., r+l  such  that  mi+  m2+. . .+m  =m. 

r  r+l 

Here,  k^  denotes  the  "number  of  basic  blocks"  contained  within  the 

block  Br,  with  k.-l  being  the  number  of  observations  on  0  in  Br. 
ii  n  x 


Corollary 

The  above  theorem  holds  under  the  permutation  probability  model 
whenever  the  sets  of  random  vectors  are  such  that  there  is  probability* 
one  of  the  construction  process  b*  being  unique. 

Proof  of  Corollary 

Let  {xi,x2,...,x  }  and  {yi,V2»«-*»y  )  denote  fixed  sets  of  obser- 
n  m 

vations  on  0^  and  0  ,  respectively.  Let  S  be  the  conditional  permutation 

sample  space  given  the  combined  fixed  set  of  observations  {xi,x2,...,x  , 

n 

yi,y2, • • • ,ym).  Then  S  is  equivalent  to  the  set  of  all  (n  +  m)-tuples 

obtained  by  permuting  this  given  combined  set  of  observations.  Let  X^, 

i  =  l,2,...,n  represent  the  random  vector  (in  the  permutation  model) 

yielding  the  vector  value  located  in  the  itk  coordinate  position  and  Y  , 

j  =  l,2,...,m,  denote  the  random  vector  yielding  the  vector  value  located 

in  the  (n  +  j)tk  coordinate  position  in  the  (n  +  m)-tuples  in  S.  Then 

the  joint  null  cumulative  distribution  of  the  random  vectors  (Xi ,X2 , . . . jX^, 

Yi,Y2,...,Y  }  is  symmetric.  Set  S  =  X  and  the  proof  follows, 
m 

This  corollary  implies  that  any  test  statistic  associated  with  a 
two-sample  tolerance  test  formed  by  the  proposed  new  method  has  the  same 
null  distribution  whether  it  was  considered  on  an  unconditional  or  per- 
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mutation  probability  basis.  Thus,  all  null  distributions  that  have  been 
obtained,  under  the  unconditional  model,  for  any  appropriate  two-sample 
tolerance  test  statistic  are  directly  usable  under  the  permutation  model. 

An  immediate  consequence  of  the  theorem  is  considered  next.  Sup¬ 
pose  0  and  0  are  independent  random  samples.  Then,  under  the  null 
n  m 

hypothesis,  their  joint  distribution  is  symmetric. 

In  Chapter  IV  it  was  shown  that  the  standard  one-sample  process  was 
a  special  case  of  the  B*  process.  Then  it  follows,  from  the  above  re¬ 
mark,  that  the  existing  method  is  a  special  case  of  the  proposed  method 
for  establishing  multivariate  two-sample  tolerance  tests.  The  existing 
method  can  then  be  extended  to  consider  two  data  sets,  not  necessarily 
independent,  whose  joint  null  distribution  function  is  symmetric.  Then, 
from  the  above  corollary,  any  two-sample  tolerance  test  formed  by  the 
existing  method  has  a  permutation  probability  basis. 

It  remains  to  show  that  any  statistic  associated  with  a  two-sample 
tolerance  test  formed  by  the  proposed  method  is  symmetric  in  the  obser¬ 
vations  on  which  the  tolerance  regions  were  defined  and  is  symmetric  in 
the  observations  used  to  establish  the  block  frequency  counts.  This  is 
equivalent  to  s'.. owing  that  any  statistic  is  symmetric  with  respect  to 

both  0  and  0  .  It  was  never  required  at  any  stage  in  the  B*  process 
n  m 

to  associate  an  observation  with  the  particular  random  vector  yielding  it. 

The  process  only  permitted  an  observation  to  be  identified  with  the  set 

of  observations  from  which  it  came.  Then  all  identified  observations 

(in  this  sense),  thus  the  set  of  all  observations,  are  symmetric  with 

respect  to  both  0  and  0  .  Hence,  any  statistic  is  symmetric  with  re- 
n  m 

spcct  to  both  0  and  0  . 

n  m 


I 
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APPENDIX  I 


Definition  I 

If  V  =  (V^ , . . .  ,V^  is  a  p-component  random  vector  and 
v  =  (v^.v^,...,^)1  is  a  p  x  1  vector  having  real  components, 
then  the  event  that  v^  simultaneously  for  all  i  =  1,2,..., p 

is  represented  by  V  <  v. 


Definition  II 

Let  Vj.Va,. .. ,V  be  a  set  of  p-component  random  vectors,  then  the 
joint  cumulative  distribution  function  of  Vi,V2,...,V  is  given  by 


KVi  <  vi,  V2  <v2)...  ,Vt  <  vt) 

where  vj,V2,...,v  are  real  component  p  x  1  vectors  and  the  events 
V.  <  v.  i  =  l,2,...,t  are  defined  in  Definition  1. 

l—i 

Lemma  1-1 

Let  W,,W2,...,W  be  a  set  of  p-component  random  vectors  which  are 
defined  on  the  sample  space  .V  and  have  a  joint  cumulative  distribution 
function 


F(wi  ,W2  , .  .  .  ,w  )  =  l’(Wi  <  W|,  W2  <  w?  , .  . .  ,W  <  w  ). 

q  q  q 

If  F(wi ,w2 , . . . »Wq)  is  »  symmetric  function  on  all  nets  (wi ,w2 , . . . ,w^} 
of  real  component  p  x  1  vectors,  then  for  any  set  of  measurable  functions 


(real  or  vector  valued)  g^(Wi ,W2, . . . ,W  )  i  =  1,2,. . . ,k  and  any  reorder¬ 
ing  (i  i  ,i2  , . • . , i  )  of  the  integers  (1,2,. ..,q) 
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P[gi(Wi,W2,...,W  )  £  a, . gk(W,,«2l...,W  )  £  a^] 

=  P[gi(W,  ,W  ,...,W  )  <  aif...,g  (W  ,W  ,...,W.  )  <  a,  ) 

lll2  1—  klil2  1  —  K. 

q  q 

where  a.  is  a  real  component  vector  of  the  same  dimensionality  as 


gi(Wx,W2,...,W  )  for  i  =  l,2,...,k. 


Proof 

Lc t  cr i  (1,2 , .  .  t  j^j)  ”*■  (i  i » i 2  >  •  •  •  > ^)  > 

A  =  { (wi ,w2 , . . . ,w^) [fei (w] ,w2, . . . ,w^)  £  ai,..., 

gR(wi ,w2 , . . • ,W  )  £  aR;  w^cRp  i  =  1,2, ... ,q}, 

A  =  {(w  ,w.  , ...,w  ) |gi(wi,w2,...,w  )  £  ai . 

U  J  1  12  x  Q 

q 

gt.  ,V2  , ,w  )  £  a^  wir.Rp  i  =  1,2,..., q}, 

and 

B  =  {  (w ;  ,w?  , . .  .  ,w  )  |  g  j  (w  ,w  , .  .  .  ,w  )  £  a  i ,  .  . .  , 

0  q  lil?  1  — 

q 

g,  (w ;  ,W  )  <  a  ;  w  fRp  i  =  1,2 . q} 

q 

where  RP  is  the  p-dimens ional  euclidian  space. 


it  suffices  to  show  that  P(A)  =  P(B  ). 


P  (A)  = 


d!'(w;  ,w,  ....  ,w  ) 

q 


dF(w  ,w  ;...,w  ) 

i,  i?  iq 


since  K(w i ,w? , . . . ,w  )  is  symmetric. 

Consider  the  transformation  C  -i,.  . 

o  ( i  . ) 

.1 

then  the  transformation  V.  =  l .  i  =  1,2,. 

i  i 

This  gives 


=  W.  for  j  =  1,2, . . .  ,q, 

j 

. , q  in  the  last  expression. 


dF(w . 

1 1 


1  2 


’Wi 


dF(w i ,w? , 


•V  "  P(V 


proving  the  lemma. 
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Lemma  T.  -  2 

Let  Wi,W2,...,W  he  a  set  of  p-component  random  vectors  defined  on 

q 

a  sample  space  !■/  and  have  a  symmetric  joint:  cumulative  distribution 

function,  F(wi ,w2 , .  . .  ,w  )  .  Let  (Zj  ,Z2 . Z^}  be  a  sttbset  of  the  random 

vectors  Wi,W?,....W  and  F  represent  the  totalitv  of  information  on  the 

q 

observations  on  Wj,  W2,...,W  wh i ch  is  symmetric  with  respect  to 
{2 i ,Z2 , . . . ,Z  )  .  If  g(W,F)  is  a  measurable  function  on  F  and  W  =  Wj 
for  any  i  =  l,2,...,q,  then  the  joint  cumulative  distribution  function 
of  g(ZuF),  g(Z2  ,F)  , . .  .  ,g(.Zt  ,F)  is  symmetric. 


Proof 


By  definition,  F  is  invariant  under  any  relabeling  of  the  identities 


in  (Zi,Z2  .  ,Zt)  . 

If  the  roles  of  the  random  vectors  7.^  and  7,  are  interchanged,  the 
statistic  g(Zi,ij  becomes  g(Z  ,/•’)  and  vice  versa.  Then  by  an  interchange 
of  the  roles  of  the  random  vectors  in  {Zj ,Z2 , . . . ,Z  } ,  the  set  of  statis¬ 
tics  {g(Z  i ,  F) ,  g(7.2,F),...,g(Z  ,F)}  is  mapped  onto  itself. 

Let  (W!  ,V.7- , .  .  .  ,V.M  -  g  (Z  j ,  F)  ,  i  =  t  in  Lemma  1-1  for 

k  =  t  and  the  proof  follows. 


information  o"  a  set  of  observations  is  defined  to  be  symmetric 
with  respect  to  a  set  of  random  vectors  if,  and  only  if,  the  information 
is  unchanged  by  int  •  ret-  mg i up,  the  roles  of  the  random  vectors.  (See 
Chapter  IV  and  Appe  uix  II). 


APPENDIX  II 

Theorem 

Let  0  =  {Xj,X2,...,X  }  and  O  =  {Yj,Y2,...,Y  }  be  two  sets  of 

n  n  m  m 

random  vectors,  not  necessarilv  independent,  defined  on  a  sample  space  X. 
These  sets  are  such  that  there  is  a  probability  one  of  the  construction 
process  B*  being  unique  (no  ties  in  the  cutting  function  values)  which 
occurs  in  particular  whenever  the  random  vectors  are  at  least  partially 
continuous.  Under  the  null  hypothesis,  let  the  combined  set  of  random 
vectors  0n+m  =  (X i , X2 , • • • .X^,  Y 1 ,Y2 , . . . , Y^}  have  a  symmetric  joint  cumu¬ 
lative  distribution  function  denoted  by  F  =  F(xi ,X2 , . . . ,xn>  . . .  ) . 

Given  a  set  of  r  +  1  blocks  (B^jB* , .  . .  jB^^}  formed  at  the  r*"^1  stage  of 
the  generalized  block  construction  process,  B *,  on  a  set  of  observations 

on  then  the  joint  null  distribution  of  . . the  respective 

block  frequency  counts  on  0  ,  is  given  by 


for  non-negative  integers  nn,  1  =  l,2,...,r+l  sucli  that  m]  +  m2  +... 

+  m  ,,  =  m  and  where  k.  denotes  the"number  of  basic  blocks "contained 
r+1  1 

X" 

within  the  block  B^,  i  =  1,2,.,., r+1. 

Proof 

The  formal  method  of  proof  uses  induction  on  the  number  of  stages 
in  the  generalized  block  construction  process,  B* ,  presented  in  Chapter  IV. 
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The  joint  null  distribution  of  the  block  frequency  counts  on  0^  is 
determined  at  each  stage.  This  is  arrived  at  by  deriving  the  conditional 
null  distribution  for  the  block  frequency  count  on  0 for  one  of  the  two 
new  blocks  formed  at  the  stage  being  considered  given  the  joint  frequency 
counts  observed  at  the  previous  stage.  This  proof  makes  repetitive  ap¬ 
plications  of  Lemmas  1-1  and  1-2  presented  in  Appendix  I.  For  conven¬ 
ience,  a  few  terms  and  symbols  defined  in  Chapter  TV  are  restated. 

Definition  1 

An  observation  is  said  to  be  identified  if  it  can  be  associated 

with  the  set  of  random  vectors  which  yielded  it;  that  is,  associated 

with  0  or  0  .  Thus,  a  set  of  observations  is  identified  if  each  obser- 
n  m 

vation  within  the  set  is  identified. 

Definition  2 

Information  on  a  set  of  observations  is  said  to  be  symmetric  with 
respect  to  a  block  (or  union  of  blocks)  if  the  information  is  unaffected 
by  interchanging  the  roles  (relabeling  the  identities)  of  the  random 
vectors  yielding  the  observations  falling  within  the  block(s) .  For 
example,  if  Vj,V2,...,V^  denotes  the  set  of  random  vectors  yielding  ob¬ 
servations  falling  within  the  block  15,  then  the  information,  7,  defined 
on  some  set  of  observations  which  may  or  may  not  contain  those  in  B,  is 
symmetric  on  B  if  for  any  reordering  (V J , . . . ,V^)  of  (Vj , V2 , . . . , V^) 

1  is  unchanged. 

Definition  3 

The  symbols  o  ,  o  ,  and  o  .  will  denote  the  observations  on  the 
n  m  n+m 

k  . 

random  vectors  in  0,0,  and  0  .  ,  respectively.  Likewise  0.  will  denote 
n  m  n-rm  J 
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the  set  of  random  vectors  yielding  the  set  of  observations  o^  which  fall 
within  block  B..  at  the  k  stage  of  the  process  B* . 

Definition  4 

The  symbols  Zi,Z2,...,Z  ,  will  denote  the  random  vectors  in  the 

n+m 

combined  set  0n+m.  Also,  the  symbols  Z^,zj . Zk(j)  wil1  rePresent 

the  random  vectors  in  0^  and  ,X"| , . . .  ,X"1  ^  ^  and  ,Y^  , . . .  )  will  t>e 

k  k  r*\ 

those  random  vectors  in  0 .  C\0  and  0.110,  respectively.  All  lower 

.1  n  j  m 

case  symbols  x,  y,  and  z  will  denote  corresponding  observed  vectors. 

At  the  first  sta^a  in  3*  all  symmetric  information  with  respect  to 

X  is  available.  This  includes  the  set  of  unidentified  observations  in 

o  .  Then  I\  consists  of  the  total  information  available  on  o  which 
n+m  n™ 

is  symmetric  on  0^^. 

Let  ijc(l,2, . . . ,n}  =  J  be  selected  based  on  I\.  Then  using  Ji, 

n 

determine  a  real-valued  measurable  function  ii>j(Z,Ti)  which  either  has  a 

contiruous  distribution  function  whenever  Z  E  0  _  and  the  random  vectors 

n+m 

in  0  have  the  joint  null  distribution  function,  F,  or  was  selected  in 
n+m 

such  a  way  that  there  are  no  ties  ~n  the  set  of  values  {(|>i  (z, I \)  |  z£on+n)} . 

Let  x*  e  o  be  such  that  <h(>?,Ji)  =  ci  is  the  i.  largest  value 
1  n  1 

in  the  set  of  real  numbers  { <t> i  (x,Ii )  | x  C  o^}.  [Note:  at  this  point  only 

x*  and  ci  are  available  information  not  the  entire  set  of  values 

{<}:i(x,Ii)  (x  c  o  }].  Now,  using  the  cutting  function  $i(x,-Ti)  =  ci  divide 

the  sample  space  X  into  the  first  stage  blocks  denoted  by  the  open  regions 

Bj  =  {x  c  x|Mx,/i)  <  ci)  and  bJ  -  {x  c  *|<f>i(x,7i)  >  ci).  There  are 

l  l 

exactly  i j  —  1  and  n  -  ii  observations  in  falling  within  Bj  and  B2 , 


respectively. 


To  determine  the  nail  probability  that  ti  ob.sorvat i oils  in  o  will 

m 

fall  within  block  Bj  (m-tj  in  B^)  is  equivalent  to  determining  the  null 

probability  that  exactly  ti  of  the  observed  variables  (hCy^Ti) 

i  =  1,2, ... ,m  have  values  less  than  C).  To  evaluate  this  probability 

it  is  necessary  to  establish  the  joint  null  distribution  of  the  random 

variables  <J>i  (Zi  ,Ji)  ,  <f>i  (Z2  ,Ii)  ,  • . .  » 4>  1  (Z  .  ,1 1). 

n+m 

Appeal  to  Lemma  1-2  (replacing  the  W^'s  with  Z  's,  set  t  =  q  =  n+m, 
and  =  g(Z,F)).  Then  it  follows  that  the  joint  cumulative  dis¬ 
tribution  of  <f> i  (Z  i , -T i  )  ,  <)>i  (Z2  , 1 1)  ,  .  . .  ,<J>i  (Z  ,-T i)  is  symmetric.  But  this 

n+m 

implies 

P[*i(Zi,Ji)  <  0i(z 2,h)  <  ...  <  4>1  cz  .  ,Ji)] 

n+m 

=  P[<h(z;,Ii)  <  «i(Zj,Ii)  <  ...  <  (Z^-Hn’^,  >  1 

for  all  reorderings  (Z  \  ,z'2  , .  . .  .Z^)  of  (Z  i  ,Z2  , . . .  .Z^)  . 

By  the  choice  of  (fn(Z,7i)  all  such  4>i -orderings  are  unique  with 
probability  one,  then  the  null  probability  of  each  <J>i-ordering  is 
1/ (m+n) ! . 

Define 

Si  =  U<h(z!,ii)  <  <J)i(Z2,i,)  <  ...  <  <j)1(z^+ni,i1)]|(zi,Z2,...,z^+m) 

is  a  reordering  of  (Zi,Z2,...,Z  „  )}. 

n+m 

Then  the  null  probability  of  each  element  in  S]  is  l/(m+n)!. 

The  null  probability  of  observing  ti  observations  in  o  within  B, 

m  1 

is  then  equivalent  to  the  null  probability  of  obtaining  an  element  in 

s  t 

Si  which  assigns  the  (ti  +  ii)  <f>i-order  position  to  0^  and  exactly  ti 

of  the  first  ti  +  ii  -  1  ({jj-order  positions  to  0  .  This  reduces  to 

m 

counting  the  number  of  elements  in  Sj  which  satisfy  the  desired  event, 
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since  all  elements  in  Si  are  equally  likely.  There  are  (™  )  ways  of 

choosing  ti  random  vectors  in  0  and  (.  n  )  wavs  of  selecting  i i  —  1 

m  1 1 —1 , 1 

s  f 

and  1  (for  the  (ti  +  ii)  "  position)  random  vectors  n  0  to  lie  within 

n 

the  first  ti  f  ii  <j>i~order  positions.  Within  each  chosen  set  of  ti 

random  vectors  in  0  there  are  tiJ  ways  of  assigning  them  to  a  fixed  set 
m 

of  ti  <J>i~order  positions;  likewise,  there  are  (ii-1).'  and  1.'  of  assign¬ 
ing  the  random  vectors  chosen  from  0  .  The  remaining  4'1-order  positions 

ti  +  ii  +  l,...,n+m  must  contain  m-ti  and  n-ii  assignments  to  0  and  0  , 

m  n 

respectively.  The  total  number  of  ways  which  these  positions  may  be 
assigned  is  (m-t i  +  n-ii)I.  Hence  the  total  number  of  elements  in  Si 
satisfying  the.  desired  event  is 

C>(,  ”  ,)  1 1 : ( i 1 - 1 ) :  l!(m-t,  +n-i,)! 

tl  lj-l,l 


=  ( 


tl  +  u-lwm-ti  +  n-i  i ,  ,  i 

)(  )m.n.  . 

t i  m- t i 


Then  the  null  probability  of  observing  exactly  t]  observations  in  o 


within  Bj  is 


P(ti)  = 


ti 


L)( 


m  -  1 1 


)  m .'  n .' 


(m  +  n)  .' 


_  ^ti  +  ii-l^m  -  ti  +  n  -  +  n) 

1 1  m  -  1 1  m 

for  t i  =  0,1, .. . ,m. 

The  joint  null  distribution  function  for  ti,t2,  tlic  frequency 
counts  in  blocks  Bj  and  B2 ,  respectively,  is  obtained  by  transforming 
m  -  t i  =  t? 

p<t.,n>  -  <f‘  *  ‘‘-'x12  t  "  ' 

1 1  t  j  m 


for  non-negative  t\.  i  =  1,2  such  that  tj  +  t2  =  m. 
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In  order  to  clarify  the  method  of  proof  the  joint  null  frequency 
count  distribution  will  be  derived  for  the  second  stage. 

The  complete  information  available  at  the  hep, inning  of  the  second 

stage  consists  of  J i.  r:i ,  x'J>  pod  '■'m  hrn  re*r-  or  meVn^ifind  observa- 

-i  ,  -i 

tions  Oj  and  o2 . 

At  the  start  of  stage  2,  one  of  the  two  first  stage  blocks  c) 
and  B* ,  is  selected  for  division.  This  selection  is  based  on  the  infor¬ 
mation  available  at  the  end  of  the  first  stage.  Suppose  b|  were  selected 
(the  proof  is  analogous  for  Bp)  and  t)  observations  in  fell  in  B ' 

If  Bp  were  to  be  decomposed  at  some  later  stage  in  the  process  Di: . 
then  I2  contains  all  information  symmetric  separately  with  respect 
to  B*  and  B*.  S'c'cv.  r,  if  B*  were  never  to  be  decomposed  at  seme  later 
stage,  then  1 2  contains  at  least  all  information  syr.-’otri c.  with  respect 
to  B*.  This  allows  ,T?  to  contain  any  information  which  is  symmetric 
with  respect  to  b|  but  not  on  B '  .  For  example,  complete  iuenti f ierhion 

of  o2.  This  is  trivially  true  when  Bp  is  a  t.nsic  block  -  all  observa¬ 

tions  within  B,1  have  to  be  in  o  .  Clearly  J?  contains  all  information 
that  was  available  at  the  end  of  stage  1. 

Next,  select  an  integer  i?  C  J  .  based  on  1 2  and  choose  a  rec.l- 

i  1-1 

valued  measurable  function  <)>2  (Z,/-)  which  either  has  a  continuous  null 
distribution  for  Z  e  f>{  or  is  such  that  there  are  no  ties  in  the  set 
of  values  {<*>2  (z,/2)  /z  c  oj). 

Let  x*  r.  o  H  o!  such  that  $2 (x?, F?)  =  C2  is  the  12nd  largest  value 
*  n  1 

in  the  set  (fo (x, T?)  jx  f  o  n  oj).  Again,  only  x*  and  c7  are  determined. 
Using  tlie  cutting  function  <t'2(x,J  2)  =  c- 1 ,  divide  the  block  IlJ  into  two 
disjoint  subsets. 
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Bj  =  {x  £  b}|(()2(x,/2)  <  c2} 

and 

Bj  =  (x  e  B  j  j  <})2  (x,  J2 )  >  c2} 


and  for  standardizing  notation,  let  B*  =  B*.  Then  there  are  exactly 
i2-  1,  ij-i2-l,  and  n  -  i2  observations  in  falling  within  blocks  B * , 

Bj  and  B*,  respectively. 

Now,  consider  the  conditional  event,  Ai,  that  ti  observations  in 

o  fall  within  B?  given  c i  and  x?.  This  event  is  the  intersection  of  the 
m  1  1 

following  two  events. 

A|  =  (<J>i  (Z  i ,  Ji )  <  ci,  4>i(z2.Ti)  <  cj,...,<J>i(Zti+ii_1,Ti)  <  ci) 

and 

A*  =  {*1(7.®  ,10  >  ci  *i(Z*,J,)  >  ci,...,*i(Z^in_ti_ij,J1)  >  ci} 


where  (Z  |  ,Zj  , .  . .  ,Z*  ,+i  ^  and  {Z \  ,Z®  , .  . .  .Z*^  ^  i )  arc  the  random 
vectors  in  0 J  and  0*  >  respectively.  Then  Ai  =  Ajfl  A.,. 

For  any  relabeling  of  the  identities  within  the  set  0 j! 


(1)  by  definition  Ii  remains  unchanged, 

(2)  event  a|  is  unaffected  since  the  set  of  all  <}>i  functions 
in  A|  is  mapped  onto  itself,  and 

(3)  each  tpi  function  in  A^  is  unchanged. 


Hence  the  event  Ai ,  and  the  null  probability  of  Ai,  is  unaffected  over 

l 

all  such  relabel ings  in  0 

The  objective  at  this  point  is  to  establish  the  conditional  null 
probability  of  any  <f>2 -ordering  on  0 j  given  event  A]  has  occurred.  Since 
the  null  probability  of  A\  is  not  changed  urder  any  relabeling  in  0 j. 


Therefore,  it 
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then  it  is  not  changed  over  all  'V -orderings  on 
suffices  to  determine  the  joint  null  probability 

Pi  =  PIZ!  i  2i.  z2  1  22... -.7.*  .  1  z  ,  Ai] 

1  —  2  —  tl+ll-1  —  t  1  +  J  1~X 

for  any  set  of  real-component  vectors  zi,Z2,...,z  . .  Now,  pi  is 

- pipl 

actually  a  joint  probability  of  functions  involving  all  random  vector' 

in  0  .  since  I i,  hence  6\(Z,I\)  is  defined  on  0  ,  . 
n+m  im:i 

If  pi  can  be  shown  to  be  symmetric  in  Zi,Z2,...,z  then  the 

t i+i i-l 

joint  conditional  null  probability 

P[Z;  <2,.  7.‘  <  . . i  2t,+i,-l  !A‘l 

would  be  a  symmetric  function  in  zi,Z2»....z,  ,.  Then  by  applying 

1 1+1  l-J. 

Lemma  1-2  it  can  be  shown  that  all  (Jg -orderings  on  given  Ai  are 
equally-likely .  Then  the  determination  of  the  frequency  count  distribution 
within  one  of  the  two  new  blocks  formed  at  the  second  stage  given  the 
event  A]  can  be  made. 

First,  to  show  that  pi  is  indeed  symmetric  in  Zi,Z2,...,z  ..  , 

ti+ii-1 

an  application  of  Lemma  1-1  will  be  made.  Using  the  notation  established 
in  Lemma  1-1  (also  replacing  the  W  ’s  with  z'’s  and  setting  q  =  n+m)  de¬ 
fine  the  following  K=n+m+ti+ij-2  functions: 


i 


j.(Zi,Z2,...,Zn+m)  -  { 


*1(zi-t,-i,+l 
<h(7* 


i-2t i — 2 i i+2 


i=l ,2, . . . ,t]+  ii-  1 
,Ji)  i=t i+i i , . . . , 2t i+2i i -2 

,/i)  i=2t i+2i i-l , . . . ,n+m+t i+i i-2 


Appealing  directly  to  Lemma  I-l,  it  follows  pi  is  symmetric  in  zi,Z2>...» 

z  ...  Thus  the  joint  conditional  null  distribution 
ti+H-1  J 
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Ail 


P[Z  <  Zl,  z:  <  i;.,  . . . , 

is  svmmetiic  in  zi.zj . ^ 

ti+i}-l 

Now  apply  Lemma  1-2  (replacing  tin  ..'.'s  with  the  conditional  zl's 

i  l 


given  Aj ,  set  t 


ti  +  ii  -  i,  un!  ,  ,r  2)  ~  g(Z, /•’)).  This  proves 


that  the  joint  conditional  null  distribution  of  ! .  (  ,/•>),  <t>2  (7.\  5/2)  , .  . .  , 

<p2  (Z  1  >-2*2 )  given  A]  is  a  symmetric  funct  ion.  Therefore,  all  possible 

ti+Xj-T 

<j>2-orderings  on  Oj  given  Ai  are  unique  (by  choice  o;  s,‘'?)  and  equally- 

likely . 

Let  S2  denote  the  set  of  all  4 ,, -orderings  on  ■!  given  Aj.  Then  each 

element  in  S2  has  a  conditional  mil  !  probability  of  l/(ti+  i  1  -  1).’  of 

occurring.  Then  the  conditional  ncil  ni obnb i 1 i f y  that  exactly  s j  obser- 

vations  on  0  fall  within  B.  riven  that  ti  observa; ions  on  0  fell  within 
m  1  in 

Bj  is  exactly  the  same  as  the  null  probability  of  observing  an  clement 

in  S2  which  assigns  tfie  (si  f  i-)n*  -Ir-order  position  to  0  and  exactly 

n 

Sj  of  the  first  si  +  i2  -  1  wa-ordcr  positions  to  . '  .  By  the  same  type 

’•I 

argument  used  at  the  first  stage,  this  prohnhilit  becomes 
P(s  1 J 1 1 )  =  (si+i2  — i-l-i?)/(t)  +  i .-l) 

Si  -  1  —  s  j  t;  1 

for  si  =  0,1,.  .  .  ,t  1  • 

Multiplying  by  P(ti),  derived  in  stage  1,  and  sub  finning  ti-si  =  sr 
and  m  -  1 1  =  S3,  then  the  joint  null  distribution  of  si,s:  ,  S3,  the 
respective  block  frequency  counts  on  ,  is 

P(S1,S2,S3)  =  (  )(  ) (  )/(  ) 

SI  s?  S3  ’> 


for  non-negative  integers  , 


1,7,1  such  that  s  i  4  sj  +  S3  -  m. 
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Similarly,  if  block  B2  had  been  chosen  for  division,  the  J2  would 

contain  at  least  all  information  symmetric  with  respect  to  B2  .  Then  for 

i2  £  I  .  and  <J>2(Z,72)  the  new  blocks  formed  would  be  a  division  of  B* 
n-i  z 

2  2  1  2 
into  B2  and  B3  where  the  first  stage  block  B3  would  be  designated  as  Bj. 

There  are  ii-1,  i2-l,  and  n-i.2  observations  in  o^  tailing  within  the 
respective  blocks.  Then  if  si,s2,S3  denote  the  respective  block  fre¬ 
quency  count  on  0^,  their  joint  null  distribution  would  become 

P(sx,s2,s3)  =  (  „  )(  )(  )/(  ) 

si  s2  S3  m 

for  all  non-negative  integers  s^,  i  =  1,2,3  such  that  sj  +  s2  +  S3  =  m. 

The  information  now  available  for  starting  the  third  stage  in  B* 

consists  of  J2,  c2 ,  x*,  and  the  unidentified  observation  sets  o,,  o2 , 

-2 

and  o3  if  all  blocks  are  to  be  further  divided  at  later  stages.  Other¬ 
wise,  if  the  decision  were  made  at  stage  2  to  never  divide  any  one  (or 

2  2  2 

two)  of  the  blocks  Bj,  B2 ,  B3  at  any  later  stage,  then  the  observations 
set(s)  associated  with  the  block (s)  selected  can  be  identified  com¬ 
pletely.  This  information  is  symmetric  on  the  remaining  blocks  and 
would  then  be  made  available  at  the  start  of  stage  3.  It  should  be  not'd 
also,  that  the  information  J2  contains  all  information  available  at  the 

start  of  stage  2,  namely:  1 1,  cj,  x*  and  the  unidentified  observation 

- 1  -1 

sets  Oj  and  o2 . 

In  the  r*"^  stage  (r  _<  n)  in  the  process  P*  let  the  blocks 


be  formed  and  e^,  i  =  l,2,...,r+l  denote  the  number  of  observations  in 
o^  lying  within  the  respective  blocks.  Assert  that  the  joint  null  dis- 
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tribution  of  mi ,m2 , . . . ,m  the  respective  block  frequency  counts  on 


0  is 
m 


P(mi,m2,...,m  )  =  {  H  (mi  *  ei)}/(m^) 

r+l  l—i  m,  m 


for  all  non-negative  integers  nr  ,  i  =  1,2,..., r+l  such  that  mi  +  m2  +. . . 

+m  =  m.  This  assertion  is  verified  by  mathematical  induction, 
r+l 

The  fact  that  this  assertion  holds  for  r  =  1,2  has  been  shown  above. 

s  t 

Now,  assume  it  holds  true  for  the  (r-1)  stage  and  all  previous  stages. 

Lei.  \  B*  \...,B*  1  denote  the  blocks  formed  at  the  (r-l)st  stage  and 

—  r_l 

h.  is  the  number  of  observations  in  o  contained  in  block  BJ  ,  1=1, 2,..., r. 
1  n  i 

Then  if  . . s^  denote  the  respective  block  frequency  counts  on  0^, 

by  assumption,  the  joint  null  distribution  of  Si,S2,...,s  is 

P(S1,S2,...,S  )  =  {  n  (si  ^  hi)}/(mtn) 

r  1=1  s .  m 

1 

for  all  non-negative  s^  ,  i  =  l,2,...,r  such  that  si  +  s2  +. ,,+s^  =  m. 

The  information  available  at  the  start  of  the  r^  stage  consists 

of  J  , ,  c  , ,  x*  , ,  and  the  two  new  unidentified  observation  sets  ob- 
r-1  r-1  r-1 

s  t 

tained  on  the  two  new  blocks  formed  at  the  (r-1)  stage.  Now, 
contains  all  information  that  was  available  at  all  previous  stages.  In 
particular,  ?  r_^  J~r_2  3  * » »  3 I\ •  Thus,  if  any  blocks  formed  at  r  ome 
previous  stage  were  chosen  to  never  be  divided  in  B*  then  the  correspond¬ 
ing  identifi'-d  observation  sets  is  information  contained  in  Further¬ 

more,  the  information  on  the  identified  observation  sets  was  symmetric 

with  respect  to  all  blocks  which  were  divided  at  later  stages,  and  hence, 

s  t 

symmetric  with  respect  to  all  blocks  available  for  division  at  the  (r-1) 

,  th 

and  r  stages. 
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Now  suppose  block  L  (for  some  j  =  l,2,...,r)  is  available  and 

selected  for  division  at  the  stage. 

Then  determine  X^-containing  all  information  which  is  symmetric 
r— 1 

with  respect  to  and  symmetric  separately  with  respect  to  all  blocks 

in  the  set  {B^  B^  that  are  intended  to  be  decomposed  at 

some  later  stage. 


Using  X  ,  select  i  e  and  a  real-valued  measurable  function 

j  r-1 

4>  (Z,J  )  either  having  a  continuous  null  distribution  for  Z  c  0  or  is 
r  r  j 

_r  n 

such  that  there  are  no  ties  within  the  set  of  values  { d>  (z ,X  )|z  £  o.  -L} 

r  r  1  j 

Let  x"  c  o  H  oT  ^  be  such  that  d>  (x*,I  )  =  c  is  the  i 
r  n  j  r  r  r  r  r 

_  j 

value  in  the  set  {({'^(x,!^)  |  x  £  o^  }.  The  cutting  function  =  c ^ 


largest 


divides  the  block  B 


r-1 


into 


(x  c  Br  1|(j>  (x,X  )  <  c  ] 
j  1 Yr  r  r 


and 


1 

,r 


B‘  =  (x  F  Br_1](J)  (x,X  )  >  c  } 
j+1  j  1  r  r  r 

For  consistency  in  notation,  the  remaining  blocks  defined  at  the 


.st 


(r-1)" "  stage  are  relabeled: 


B[  =  1  i  =  1,2, .. . , j-1 


and 


Br  =  i  =  j  +  2 . r  +  1 

l  i-l  J 

s  t 

The  event  A^__j  that  the  respective  block  frequency  counts  at  the  (r-1) 

stage  was  si,S2,...,Kr  is  determined  by  considering  the  following  facts. 

&  r-1 

In  the  construction  process  V  ,  each  block  i  =  l,2,...,r  was 

s  t 

originally  established  either  at  the  (r-1)  stage  or  some  earlier  stage 
by  dividing  some  block  previously  established,  and  each  of  those  blocks 
were  formed  by  dividing  some  block  established  yet  earlier,  etc.  For 


i 

i 


J 


-/a- 


r-"  1 

each  block  Ik  consider  only  those  stages  in  the  process  B*  for  which 

r*~l 

one  of  the  two  blocks  newly  formed  at  the  stage  contains  the  block  Ik 
r  —  2 

Then  each  block  Ik  can  be  associated  with  a  unique  subset  of  integers 

in  {1,2, . • . ,r-l}  such  that  each  integer  within  the  subset  represents  a 

stage  level  in  which  two  new  blocks  were  defined  (from  the  set  of  blocks 

established  at  previous  levels),  one  of  which  contains  block  b  If 

r-l 

i ,  were  in  the  subset  associated  with  block  B.  then  either 

j  x 

.r-l 


B.  C  (x  e  X 1 .  (x,I,  )  <  c.  ) 
i  1  31  ji  Ji 


or 


Br  1  C  {x  c  X | <{> .  (x,J.  )  >  c.  }  . 
r  1  Ji  3i  31 

Thus  the  subset  of  integers  associated  with  B^  can  be  partitioned  into 
two  unique  subsets  {aj  (i  )  ,32  (i)  , .  .  .  >au^j  (i)  }  and  {b  i  (i)  ,b2(i)  , . .  .  ,bv^j  (i)  ) 

r-l 

such  that  the  block  B.  is  defined  by 

i 

Bi  1  =  tx  C  '•biai(i)(x’Ji)  <  cai(i)’  4>a2(i)(x’7i)  <  Ca2(i) . 


i  ,.,(i)(x’7i)  <  Ca  . . .  (i)  ’  <}>bi(i)(x’Ji)  >  Cb  iCi)*""** 

u(i)  u(i) 


<b  ’  %  ... (i)}  for  1 ' 1>2 . r- 

v(l)  v(l) 


As  before  define  =  {Zj,Z2 . Z1  +h  }  as  the  set  of  random  vectors 

1  _i  i  i 

yielding  observations  in  block  B^  ,  i  =  l,2,...,r.  Then  consider  the 

s  t 

events  defined  at  the  (r-l)  stage 


0,  b1.-)  <  (i) '‘'Jk,Ji/  '  '‘a  ,,v(i) 


l»k 


ii(i)'k,Ji'  ai  (i) ' 


and 

D+  .  =  4.  ,.*(Z.\l.)  >  ci  .  m(Zk,7i)  >  %  (i)J 

i,k  b  i  (  j  )  k  i  3ii(i)  bv(i)  k  1  v(i)'1  ; 


for  k  =  l,2,...,s.  +  Ik  and  i  =  l,2,...,r. 


Then  the  event  of  obtaining  s^  observations  in  in  a*  is  equivalent 

co  the  event  that  L)^  ^  and  D*  ^  hold  simultaneously  fur  k  =  1,2,..., 

s^  +  h..  This  event  can  be  expressed  as 

s  ,+h . 

Ai  =  k^l  fDi,k  n  Di,kJ* 

The  event  of  obtaining  si,S2,. ..,s  observation  in  o  in  the  blocks 

r  m 

\b2  \...,Br  1  ,  respectively,  is 


A  ,  =  .nr 

r-1  i=l  i 

Now,  consider  the  random  vectors  (Z j ,Z2 , . . . ,Z^  )  =  0^  ^  which 

r-1  Sj  j  i 

yield  observations  in  B.  .  The  information  sets  I \ ,1% , . . . tI are  de- 

X*-l  x*—l 

fined  to  be  symmetric  on  B .  ,  otherwise  B.  would  not  be  available  for 

J  J 

cutting  at  the  rtl'  stage.  Then  the  (^-functions  within  the  events  ^ 
for  i  =  l,2,...,r  and  i  /  j  are  unchanged  over  any  relabeling  within 

r—  1  r— 1 

0 ,  thus  the  events  are  unchanged.  The  event  A^.  is  mapped  onto  itself 

r-l 

by  relabeling  within  0.  .  Hence,  A^  ^  and  the  probability  of  A^^  is 

r-1 

unaffected  by  any  such  relabeling  within  0 ^  .  Then  the  joint  null 


probability 


pl/.|  <  z;,  Z\  <  z2,...,Z^  +h  <  *  +h  ,  Ar-1] 

3  1  J  1 


is  symmetric  in  the  vectors  zi,Z2,...,z  by  Lemma  1-1.  Therefore, 

J  J  j  j  j 

the  joint  conditional  null  cumulative  distribution  of  Zj,7.2,...,Z  . , 

3  3 

given  A  ^  is  symmetric  in  the  vectors  . . Zs  +h  ’  ^en  ky  applv- 

jSj  j  j 

ing  Lemma  1-2  the  joint  null  distribution  of  <j>^(Z "J ,  1^)  ,  <f>r (Z ^  ,  1^)  ,  .  .  . , 

<fc  (y 1  ,]  )  given  A  ,  is  found  to  be  symmetric.  It  follows  that  all 

l  s.+li.  r  r-1 

J  •'  r-1 

possible  2^ -ordering;;  on  0  are  equally-likely .  Using  the  same  type 


-80- 


arguinent  employed  before,  he  <  oMclit-ion.il  mill  prob.d  i  1  i  t  y  that  n . 

observations  in  o  fall  w !  t  bin  Meek  I:,  given  tlu-  Mock  f  requencv  counts 
in 

si  »S2, . . .  ,s  determined  at  the  (r-l)'L  .-ye  be com. 


P(m. jsi ,s2 .... ,sr) 


/  m .  -1-  i  ,  \  /  s . -m ,+h . - i  ■  Its  .  +  h 
J  r-1  !  .1  J  1  r  /  J  .1 


"j  ) 


- .  -in .  / 

.1  .1  ' 


si  ! 


for  m^.  =  0,1, . . .  ,Sj. 

observe  that  (in  terms  of  the  notation  defined  for  the  rf^  stage)  the 
following  equalities  hold: 


e.  =  h.  and  m.  •=  s. 
it  ii 


for  i  =  1,2 . j-1  , 


e.  =  h.  ,  and  tn.  =  s.  for  i  =  j+1 . r+1  . , 

l  l-l  j.  l-l 

m.  +  m.,,  =  s.,  i  -j  =  e.,  and  h .  -  i  =  e 

J  .1+1  3  r  j’  3  r  j+1 

Multiplying  the  above  conditional  null  probability  by  the  joint  probability 
P(s  i ,  s2  , .  .  .  ,s  )  and  using  the.  above  equalities,  the  joint  null  distribu¬ 
tion  or  m i  .  .m  becomes 

r  •  i 


P(mi ,m2 , . . . ,mr+1) 


r+1  m .  +  e . 

{  n  (  1  1))/(m  n) 

,  ,  m .  m 

1=1  l 


for  all  non-negative  rre,  i  =  1,2,...,  r+1  such  that  rq  +  r.i2  +.  .  •+mr+j  =  m* 
This  completes  the  proof  of  the  assertion. 

Now  by  the  definition  given  in  Chapter  IV,  tin-  basic  Mod  are 

the  n  +  1  statistically  equivalent  blocks  obtained  if  the  process  P* 

could  be  continued  through  the  n*'1  stage.  In  this  case,  each  F>_.  would 

-  r 

be  further  divided  until  all  the  observations  in  o  lying  in  B  were 

n  i 

consumed.  Thus,  if  block  Br  contained  e.  observations  in  o  then  there 

J  l  u 


would  eventually  be  e.  +  1  basic  blocks  formed  within  B^.  Substituting 
k.  *  e  +  1  for  i  =  l,2,...,r+l  in  the  above  probability  expression 
gives  the  desired  results. 
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